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HIV-1 VACCINES . ANTIBODY COMPOSITIONS RELATED THERETO . AND 
5 THERAPEUTIC AND PROPHYLACTIC USES THEREOF 



Backcrro ""^ the Invention 

10 

Throughout this application, various publications are 
referenced by Arabic numerals. Full citations for these 
references may be found at the end of the specification 
immediately preceding the claims. The disclosure of these 
15 publications is hereby incorporated by reference into this 
application to describe more fully the art to which this 
invention pertains. 

The life cycle of animal viruses is characterized by a 
2 0 series of events that are required for the productive 
infection of the host cell. The initial step in the 
replicative cycle is the attachment of the virus to the cell 
surface, which attachment is mediated by the specific 
interaction of the viral attachment protein (VAP) to 
25 receptors on the surface of the target cell. The 
differential pattern of expression of these receptors is 
largely responsible for the host range and tropic properties 
of viruses. In addition, an effective immune response 
against many viruses is mediated through neutralizing 
30 antibodies directed against the VAP. The interaction of the 
VAP with cellular receptors and the immune system therefore 
plays a critical role in infection and pathogenesis of viral 
disease. 

35 The human immunodeficiency virus type 1 (HIV-l) infects 
primarily helper T lymphocytes , dendritic cells, and 
monocytes /macrophages --cells that express surface CD4-- 
leading to a gradual loss of immune function. This loss of 
function results in the development of the human acquired 
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immunodeficiency syndrome (AIDS) (l) . The initial phase of 
the HIV-1 replicative cycle involves the high-affinity 
interaction between the HIV-1 exterior envelope glycoprotein 
gpl20 and cell surface CD4 (K d approximately 4 x 10" 9 M) (2) . 
5 Several lines of evidence demonstrate the requirement of 
this interaction for viral infectivity. The introduction 
into CD4" human cells of cDNA encoding CD4 is sufficient to 
render otherwise resistant cells susceptible to HIV-1 
infection (3) . In vivo . viral infection appears to be 

10 restricted to cells expressing CD4, indicating that the 
cellular tropism of HIV-1 is largely determined by the 
pattern of cellular expression of CD4 - Following the 
binding of HIV-l gpl20 to cell surface CD4, viral and target 
cell membranes fuse by a mechanism that is poorly 

15 understood, resulting in the introduction of the viral 
capsid into the target cell cytoplasm (4) • 

Mature CD4 has a relative molecular mass (Mr) of 55 kDa and 
consists of an N- terminal 372 -amino acid extracellular 

2 0 domain containing four tandem immunoglobulin- like regions 

(VI -V4) , followed by a 23 -amino acid transmembrane domain 
and a 3 8 -amino acid cytoplasmic segment (5, 6) . In 
experiments using truncated sCD4 proteins, it has been shown 
that the determinants for high-affinity binding to HIV-1 
25 gpl2 0 lie solely within the N- terminal immunoglobulin- like 
domain (VI) (7-9) . Mutational analysis of VI has defined a 
discrete binding site (residues 38-52) that comprises a 
region structurally homologous to the second 
complementarity- determining region (CDR2) of immunoglobulin 

3 0 genes (9) . 



The production of large quantities of sCD4 has permitted a 
structural analysis of the two N- terminal immunoglobulin- 
like domains (V1V2) . The structure determined at 2.3 
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angstrom resolution reveals that the molecule has two 
tightly- associated domains, each of which contains the 
immunoglobulin- fold connected by a continuous beta strand. 
The putative binding sites for monoclonal antibodies, class 
5 II major histocompatibility complex (MHO molecules , and 
HIV-i gpl20, as determined by mutational analyses, map on 
the molecular surface (10, 11) . 



The HIV-i envelope gene env encodes an envelope glycoprotein 
10 precursor, gpl60, which is cleaved by cellular proteases 
before transport to the plasma membrane to yield gpl20 and 
gp41. The membrane -spanning glycoprotein, gp41, is non- 
covalently associated with gpl20,. a purely extracellular 
glycoprotein. The mature gp!20 molecule is heavily 
15 glycosylated (approximately 24 N- linked oligosaccharides) , 
contains approximately 480 amino acid residues with 9 intra - 
chain disulfide bonds (12) , and projects from the viral 
membrane as a dimeric or multimeric molecule (13) . 

20 Mutational studies of HIV-1 gpl20 have delineated important 
functional regions of the molecule. The regions of gpl20 
that interact with gp4l map primarily to the N- and C- 
termini (14) . The predominant strain- specif ic neutralizing 
epitope on gpl20 is located in the 32-34 amino acid residue 

25 third variable loop, herein referred to as the V3 loop, 
which resides near the center of the gpl20 sequence (15) . 
The CD4 binding site maps to discontinuous regions of gpl20 
that include highly conserved or invariant amino acid 
residues in the second, third, and fourth conserved domains 

30 (the C2, C3, and C4 domains) of gpl20 (16). It has been 
postulated that a small pocket formed by these conserved 
> residues within gp!20 could accommodate the CDR2 loop of 

CD4, a region defined by mutational analyses as important in 
interacting with gp!20 (17) . 



35 
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; HIV-1 gpl2 0 not only mediates viral attachment to surface 
CD 4 molecules, but also serves as the major target of 
antibodies which neutralize non- cell -associated virus and 
inhibit cell to cell viral transmission. 

5 

There are two major classifications of HIV-l-neutralizing 
antibodies: type-specific and group-common (15). Type- 
specific neutralizing antibodies primarily recognize linear 
determinants in the highly variable V3 loop of gpl20. These 

10 antibodies act by inhibiting fusion between HIV-1 and the 
target cell membrane, and generally neutralize only a 
particular isolate of, or closely related strains of, HIV-l. 
Sequence variation within the V3 loop, as well as outside of 
this region, permits viruses to escape neutralization by 

15 anti-V3 loop antibodies. In contrast, group- common 
neutralizing antibodies primarily recognize discontinuous or 
conformational epitopes in gpl20, and possess the ability to 
neutralize a diverse range of HIV-1 isolates. These broadly 
neutralizing antibodies often recognize a site on gpl20 

20 which overlaps the highly conserved CD4 -binding site, and 
thus inhibits gpl20-CD4 binding. 

A structural relationship has been demonstrated between the 
V3 loop and the C4 region of gpl20 which region constitutes 
25 both part of the CD4 binding site and part of the conserved 
neutralization epitopes. It was observed that deleting the 
V3 loop resulted in significantly increased binding of a 
panel of broadly neutralizing hMoAbs (neutralizing human 
monoclonal antibodies) to the CD4 binding site (18) . 

30 

A major goal in AIDS vaccine development is to develop a 
vaccine able to protect a subject against the numerous 
genetic variants of HIV-1 that infect humans. Although 
cell -mediated immune responses might serve to control 
35 infection in HIV-1 -infected individuals, several lines of 
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evidence demonstrate that protection against infection is 
mainly mediated by neutralizing antibodies directed against 
gpl20. Early experiments showed that immunization of 
chimpanzees with recombinant gpl2 0 induced a protective 
5 immune response against challenge with the homologous HIV-l 
strain (17) . This protection correlated with the presence 
of high- titer neutralizing antibodies against the V3 loop of 
gpl20. In addition, passive immunization of chimpanzees 
with a V3-loop neutralizing monoclonal antibody resulted in 

10 protection against challenge with the homologous HIV-l 
strain (19) . Although protection against challenge was 
demonstrated in these two experiments, recent studies have 
questioned the clinical relevance of these findings. For 
example, these neutralizing antibodies recognize the V3 loop 

15 determinants of a single strain, and not conserved or 
discontinuous epitopes. Thus, these antibodies lack the 
ability to neutralize the broad spectrum of HIV-l strains 
present in an HIV-l population. Furthermore, the challenge 
virus was the homologous HIV-l laboratory adapted LAI (HTLV- 

20 IIIB) strain and not one of the primary isolates that 
contain considerable gpl20 sequence heterogeneity. Since 
these experiments showed that gpl20 subunit vaccination 
induces an immune response effective against only the 
homogeneous HIV-l strain used as an antigen, it is unlikely 

25 that the vaccination regimens used in these studies would be 
useful in humans. 



Individuals infected by HIV-l typically develop antibodies 
that neutralize the virus in vitro . and neutralization 

3 0 titers decrease with disease progression (19) . Analysis of 
sera from HIV-l -infected humans indicates that type-specific 
neutralizing antibodies appear early in infection. Later in 
the course of infection, a more- broadly neutralizing 
antibody response develops. However this antibody response 

35 is of significantly lower titer and/or affinity. 
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Fractionation studies of HIV-i antibody-positive human sera 
reveal that the type- specif ic neutralizing activity is 
primarily directed against linear determinants in the V3 
loop of gpl20 (20) . There was no correlation found among 
5 antibodies between the ability to neutralize divergent. HIV- 1 
isolates and reactivity to the V3 loop of these isolates. 
In contrast, the broadly neutralizing antibodies present in 
HIV-1 antibody -positive human sera primarly recognize 
discontinuous epitopes in gpl20 which overlap the CD4- 
10 binding site and block gpl20-CD4 binding. In other words, 
the broadly neutralizing activity of neutralizing antibodies 
is not merely the result of additive anti-V3 loop 
reactivities against diverse HIV-1 isolates which appear 
during infection. 

15 

Recently, several groups have generated human monoclonal 
antibodies (hMoAbs) derived from HIV-1 infected individuals 
which possess type- specif ic or group- common neutralizing 
activities (17) . The type-specific neutralizing hMoAbs were 
20 found to recognize linear determinants in the V3 loop of 
gpl20. In contrast, the group- common neutralizing hMoAbs 
generally recognize discontinuous epitopes which overlap the 
CD4 -binding site and block gpl20-CD4 binding. 

The V3 loop is a highly immunodominant region of gp!20 which 
partially interacts with the CD4-binding region. The 
presence of the V3 loop region on gpl20 may skew the humoral 
immune response away from producing antibodies which 
specifically bind to the CD4-binding domain of gpl20. 
Furthermore, the advantages of removing the V3 loop to 
expose the CD4- binding domain of gpl20 to the immune system 
would be countered by the fact that the exposed CD4 -binding 
site would still have a high affinity for cell surface CD4 . 
In other words, a mutant gpl20 protein missing only the V3 
loop would quickly bind to CD4+ cells and would thus be 



30 
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hampered in generating an immune response against the 
exposed CD4 -binding site. 

The subject invention provides a mutant HIV-l gpl20 envelope 
5 glycoprotein which overcomes both the problems of V3 -.loop 
immunodominance and of the high affinity to CD4 . The 
subject invention further provides vaccines comprising the 
mutant HIV-l gpl20 envelope glycoprotein, antibodies which 
specifically bind to the CD4 -binding site of HIV-l gpl2 0 
10 envelope glycoprotein, pharmaceutical compositions 
comprising these antibodies, and methods of using these 
vaccines and compositions to treat or prevent HIV-l 
infection. 



15 
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The subject invention provides a recombinant nucleic acid 
molecule which encodes a mutant HIV-l gpl20 envelope 
5 glycoprotein comprising a V3 loop deletion and a C4 domain^.. 
>X) point mutation, wherein X is an amino acid residue other 
than tryptophan. In the preferred embodiment, X is a valine 
residue. 

10 In one embodiment, the nucleic acid molecule is a DNA 
molecule. The DNA molecule may be a plasmid. In one 
embodiment, the plasmid comprises the sequence of the 
plasmid designated PPI4-tPA. 

15 In one embodiment, the C4 domain is an HIV-1^ gpl20 

envelope glycoprotein C4 domain. The mutant HIV-l gpl20 
envelope glycoprotein may be a mutant HIV-1^ gpl20 envelope 
glycoprotein . 

20 In another embodiment, the C4 domain is an HIV-l^^ gpl20 

envelope glycoprotein C4 domain. The mutant HIV-l gpl20 

envelope glycoprotein may be a mutant HIV-Ijr.fl gpl20 
envelope glycoprotein. 

25 The subject invention also provides the mutant HIV-l gpl20 
envelope glycoprotein encoded by the recombinant nucleic 
acid molecule of the subject invention. 

The subject invention further provides a vaccine which 
3 0 comprises a therapeutically effective amount of the mutant 
HIV-l gpl20 envelope glycoprotein of the subject invention, 
and an adjuvant. 

The subject invention further provides a method of treating 
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an HIV- 1- infected subject, which comprises immunizing the 
HIV- 1 - infected subject with the vaccine of the subject 
invention, thereby treating the HIV- 1- infected subject. 



5 The subject invention further provides a vaccine which 
comprises a prophylactically effective amount of the mutant 
HIV-1 gp!20 envelope glycoprotein of the subject invention, 
and an adjuvant. 

10 The subject invention further provides a method of reducing 
the likelihood of an HIV-1 -exposed subject's becoming 
infected with HIV-1, which comprises immunizing the HIV-1- 
exposed subject with the vaccine of the subject invention, 
thereby reducing the likelihood of the HIV-l-exposed 

15 subject's becoming infected with HIV-l. 

The subject invention further provides a method of reducing 
the likelihood of a non- HIV-l -exposed subject's becoming 
infected with HIV-1, which comprises immunizing the non-HIV- 
2 0 1- exposed subject with the vaccine of the subject invention, 
thereby reducing the likelihood of the non- HIV- l- exposed 
subject's becoming infected with HIV-1. 

The subject invention further provides a method of obtaining 
25 partially purified antibodies which specifically bind to the 
CD4-binding domain of HIV-l gpl20 envelope glycoprotein, 
which method comprises (a) immunizing a non- HIV- 1- exposed 
subject with the vaccine of the subject invention, (b) 
recovering from the immunized subject serum comprising said 
30 antibodies, and (c) partially purifying said antibodies, 
thereby obtaining partially purified antibodies which 
specifically bind to the CD4 -binding domain of HIV-l gpl20 
envelope glycoprotein. In the preferred embodiment, the 
subject is a human. 
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In one embodiment, the subject is a medical practitioner. 
In another embodiment, the subject is a newborn infant. 

5 Finally, the subject invention provides a method of reducing 
the likelihood of a non-HIV-1- exposed subject's becoming 
infected with HIV-1 as a result of exposure thereto during 
an incident wherein there is an increased risk of exposure 
to HIV-l, which comprises administering to the subject 

10 immediately prior to the incident a dose of the composition 
of the subject invention effective to reduce the population 
of HIV-1 to which the subject is exposed during the 
incident, thereby reducing the likelihood of the subject's 
becoming infected with HIV-1. In one embodiment, the 

15 subject is a medical practitioner. 



WO 94/22477 

12 

Brief Description of the Figures 



PCT/US94/03282 



Figure 1 

gpl20 structure . Shown is a box diagram of HIV-1 gpl20 
5 depicting the boundaries of the five constant domains. (Cl- 
C5) and the five variable domains (V1-V5) . The amino acid 
residue numbering above the box begins at the initiator 
methionine found at the beginning of the signal sequence (S) 
and is approximated based on a consensus of all known HIV-l 
10 gpl20 amino acid sequences. Also shown are the C4 domain 
amino acid sequences of HIV-1 strains LAI and JR-FL. Above 
the C4 domain sequences are indicated two mutations that 
reduce gpl2 0 binding to cell surface CD4 ; tryptophan to 
valine and aspartate to alanine. 

15 

Figure 2 

PPI4- tPA- 0012 0 ^. Expression vector with the HIV- 1^ gpl20 
gene fused to the CMV MIE promoter, and the tPA signal 
sequence replacing the HIV-1 gpl20 signal sequence. 

20 Abbreviations: CMV MIE = cytomegalovirus major immediate 
early, E = enhancer, P = promoter, EXA = Exon A, INA = 
Intron A, EXB = Exon B, tPA ss « human tissue plasminogen 
activator signal sequence, gpl20 = glycoprotein 120, BGH = 
bovine growth hormone, AMP - ampicillin resistance gene, and 

25 DHFR « dihydrofolate reductase gene. 

Figure 3 

CMV MIE promoter fused to tPA-qpl20 TA 1 . The nucleotide 
sequence of the CMV MIE promoter/enhancer region is shown 
30 fused to the HIV- lj^, gpl20 gene that contains the tPA signal 
sequence. The numbering of nucleotide sequence begins with 
the Hindi site and the numbering of the amino acid sequence 
begins with the first methionine found in the tPA signal 
sequence. The tPA signal sequence is fused in- frame to Thr 31 
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of gpl20, the first amino acid found in mature gpl20. The 
signal sequence is shown in bold as are various landmark 
restriction sites used for cloning as discussed in the text. 
The locations of Exon A, Intron A, Exon B and the 
5 transcription start site and the signal cleavage site are 
indicated. 

Figure 4 

Transient expression of crol20 . Autoradiograph of 35 S- labeled 
10 supernatants from COS cell transf ectants , immunoprecipitated 
with a CD4- immunoglobulin- Protein A-Sepharose complex, and 
run on a reducing 10% SDS-PAGE gel. Theplasmids used for 
transf ection were: Lane 1: Mock transf ected cells; lane 2: 
a vector encoding a CD4- immunoglobulin chimera as a positive 
15 transfection control; lane 3: PPI4-tPA-gpl20 LA i; and lane 4: 
PPI4- tPA-gpl20jR.PL. Positions of molecular weight markers 
are indicated. 

Figure 5 

20 Determination of opl20 concentration by ELISA . Panel A: 
Concentrations of gpl20 in media of CHO cell lines, stably 
transfected with PPI4- tPA-gpl20 LA1/ determined by ELISA. 
Panel B: A standard curve was established using known 
amounts of gpl20. 

25 

Figure 6 

Expression of g p!20 in stably transfected CHO cells. 

Autoradiograph of 35 S- labeled supernatants from stable CHO 
cell lines, immunoprecipitated with a CD4- immunoglobulin - 
3 0 Protein A-Sepharose complex, and run on a reducing 10% SDS- 
PAGE gel. Lane 1: clone 9; lane 2: clone 13; lane 3: clone 
6; lane 4: Clone 5. Positions of molecular weight markers 
are indicated. 
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Figure 7 

tPA-gpl20 J Tl FL , The nucleotide and deduced amino acid sequence 
of the tPA signal sequence fused to HIV-l JR . FL gpl20 is shown. 
The Narl and NotI restriction endonuclease sites used for 
5 cloning are shown in bold. The predicted site of cleavage by- 
signal peptidase between Arg 35 and Val 36 is indicated. 

Figure 8 

tPA-gpl2 0 LM -V3 c ' ) . The nucleotide and deduced amino acid 
10 sequence of the tPA signal sequence fused to HIV-l^ gpl20 
with the V3 loop deleted and replaced with the pentapeptide 
TGAGH is shown. The V3 loop replacement and the Narl and 
NotI restriction endonuclease sites used for cloning are 
shown in bold. The predicted site of cleavage by signal 
15 peptidase between Arg 35 and Thr 36 is indicated. 

Figure 9 

tPA-gpl20 mFI -V3 () . The nucleotide and deduced amino acid 
sequence of the tPA signal sequence fused to HIV-Ijr.fl gpl20 
20 with the V3 loop deleted and replaced with the pentapeptide 
TGAGH is shown. The V3 loop replacement and the Narl and 
NotI restriction endonuclease sites used for cloning are 
shown in bold. The predicted site of cleavage by signal 
peptidase between Arg 35 and Val 36 is indicated. 

25 

Figure 10 

tPA-gpl20 TM -V3 ( ' ) -CD4 ( ' ) . Shown is the nucleotide and deduced 
amino acid sequence of the tPA signal sequence fused to HIV- 
Iiai gp!20, with the V3 loop deleted and replaced with the 
3 0 pentapeptide TGAGH, and Trp^ mutated to Val. The mutations 
and the Narl and NotI restriction endonuclease sites used 
for cloning are shown in bold. The predicted site of 
cleavage by signal peptidase between Arg 35 and Thr 36 is 
indicated. 
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Figure 11 

t. PA - op 1 2 0 F P F L - V3 c-) - CD 4 H . Shown is the nucleotide and deduced 
amino acid sequence of the tPA signal sequence fused to HIV- 
1jr-fl gpl20, with the V3 loop deleted and replaced with the 
5 pentapeptide TGAGH , and Trp 396 mutated to Val . The mutations 
and the Narl and NotI restriction endonuclease sites used 
for cloning are shown in bold. The predicted site of 
cleavage by signal peptidase between Arg 35 and Val 36 is 
indicated. 

10 

Figure 12 

tPA-gpl20 LA I -CD4 ( ' ) . Shown is the nucleotide and deduced amino 
acid sequence of the tPA signal sequence fused to HIV-l^ 
gpl2 0. The Trp 437 to Val CD4 binding mutation, the Narl and 
15 NotI restriction endonuclease sites used for cloning, and 
the predicted site of cleavage by signal peptidase between 
Arg 35 and Thr 36 are shown in bold. 

Figure 13 

2 0 t PA^ gp!2 0^ ni ~ CD4 ( ' } . Shown is the nucleotide and deduced amino 
acid sequence of the tPA signal sequence fused to HIV-i^fl 
gpl20. The Trp 424 to Val CD4 binding mutation, the Narl and 
NotI restriction endonuclease sites used for cloning and the 
predicted cleavage by signal peptidase between Arg 35 and Val 36 

25 are shown in bold. 

Figure 14 

Expression of opl20 in stably transfected CHO cells . 
Autoradiograph of super 33 S- labeled supernatants from stable 
30 CHO cell lines, immunoprecipitated with MoAb F105 - Protein A- 
Sepharose complex, and run on a reducing 10% SDS-PAGE gel. 
Panel A: Lane 1: tPA-gp!20 LAI CHO cells; lane 2: tPA-gp^O^- 
V3 W CHO cells; lane 3: tPA-gpl20 LAI -V3 ( - ) - CD4 (_) CHO cells. Panel 
B: Lane 1: tPA-gpl20xR.pL CHO cells; lane 2: tPA-op^Om-FL-V^ 
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CHO cells; lane 3: tPA-gpl20 JR . FL -V3 ( ; > -CD4 ( - ) CHO cells. 
Positions of molecular weight markers are indicated. 

Figure 15 
5 Purified gpl20 proteins. 

Silver stained 10% SDS-PAGE gel with a sample of purified 
gpl20 proteins. Panel A: Lane 1: tPA-gpl2 0 LAI CHO cells; lane 
2: tPA-gpa^Oxju-VS" CHO cells; lane 3: tPA-gpl20 LAI -V3 ( - ) -CD4 ( - ) 
CHO cells. Panel B: Lane 1: tPA-gpl20 JR _ FL CHO cells; lane 2: 
10 tPA-gpl20, R _ FL -V3 ( -> CHO cells; lane 3: tPA-gpl20 JR . FL -V3 ( - ) -CD4 t " ) CHO 
cells. Positions of molecular weight markers are indicated. 

Figure 16 

Analysis of binding of recombinant mutant gpl 20 to cell 

15 surface human CD4 bv FACS . 

Plate l. DG44 cells, a subclone of CHO cells which lack 
expression of the human CD4 protein, were used as control. 
Increasing concentrations of HIV-1 gp^Ouu did not show an 
increase in specific fluoresence when compared to 

20 background. Plate 2. DG44 #3 cells are a CHO cell line 
transfected with the cDNA clone encoding the human CD4 
protein. Increasing concentrations of HIV-1 gpl20 LAJ show a 
dramatic increase (or shift) in fluoresence. Plate 3. 
Similar to Plate 2 but the HIV-1 gpl20 LAI -V3 ( " ) protein was 

25 added. Again a large shift indicating binding to the DG44 
#3 cells was seen. Plate 4. DG44 #3 cells were incubated 
with either HIV-1 gpl20 LAI -V3 ( - ) -CD4<-> protein or MoAb OKT4A an 
antibody with high affinity for human CD4. Only 0KT4A bound 
to the cells. 
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The plasmids designated PPI4-tPA-gpl20 LAI and PPI4- tPA-gpl20 JR _ 
FL were deposited pursuant to, and in satisfaction of, the 
5 requirements of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure with the American Type Culture 
Collection (ATCC) , 12301 Parklawn Drive , Rockville, Maryland 
20852 under ATCC Accession Nos. 75431 and 75432, 
10 respectively. The plasmids PPI4 - tPA-gpl20 LA i and PPI4-tPA- 
gpl20 JR . FL were deposited with the ATCC on March 12, 1993. 

The subject invention provides a recombinant nucleic acid 
molecule which encodes a mutant HIV-l gpi20 envelope 
15 glycoprotein comprising a V3 loop deletion and a C4 domain^ 
>X) point mutation, wherein X is an amino acid residue other 
than tryptophan. In the preferred embodiment, X is a valine 
residue. 

2 0 In one embodiment, the nucleic acid molecule is a DNA 
molecule. The DNA molecule may be a plasmid. In one 
embodiment, the plasmid comprises the sequence of the 
plasmid designated PPI4-tPA. 

25 The V3 loop of HIV-l gpl20 envelope glycoprotein is shown in 
Figure 1. The V3 loop is demarcated by cysteine residues at 
both its N- and C- termini. As used herein, a V3 loop 
deletion means a deletion of one or more amino acid residues 
between the terminal cysteine residues, with the proviso 

30 that there must be three or more amino acid residues 
situated between the two terminal cysteine residues in a V3 
loop deletion. These three or more amino acid residues may 
either be residues originally present in the V3 loop, or 
exogenous residues. For example, as shown in the 
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Experimental Details section infra , the pentapeptide TGAGH 
is situated between the two terminal cysteine residues. 
Variations in the size of the V3 loop deletion illustrated 
herein are tolerable without affecting the overall structure 
5 of the mutant HIV-1 gpl20 envelope glycoprotein, as is .well 
known to those skilled in the art. 

As used herein, "C4 domain" means the HIV-1 gpl20 envelope 
glycoprotein C4 domain having the following consensus 
10 sequence: 

X] X 2 X 3 C X 4 I X5X 6 X7X g X9Xi 0 WX 11 X 1 2X 1 3Xi4Xj5AX 16 YX] 7 X 1 g - 

PXJ9X2QX21X22X23X24X25X26SX27X28TGX29X30X31X32RX33GX34, 

15 wherein X 1 ~ T, I, V, K or R; X 2 = L, I or H; X 3 ■= P, Q, L or 
T; X4 = R, K or G; X 5 = K or E; X* « Q or E; X 7 = F, I or V; 
X 8 « I, V or M; X 9 = N, R or K; X 10 - M, R, L or T; X u = Q, R 
or V; X t2 = E, K, G, R # V or A; X l3 = V, T, A or G; X, 4 = G or 
E; X 15 = K, R, E, or Q; X l6 « M, V, I or L; X 17 « A, T or D; X 18 

20 = P or L; X 19 - I or F; X 20 = S, R, G, K, N, A, E or Q; X 21 = 
G or R; X 22 = Q, L, P, N, K, V, T, E or I; X23 = I, V or L; X w 
= R, K, S, N, G, I, T, E or I; X25 = C or R; X 26 = S, L, I, T, 
P, E, V, K, D or N; = N, K or L; Xjg - I or V; X 29 = L, P 

or I; X 30 = L or I; X 31 = L or I; X 32 = T , A, I, V or E; X 33 = 

25 D or E; X^ = G or V. 

The C4 domain consensus sequence is based on existing C4 
domain sequence information from various HIV-l strains, and 
thus is not necessarily an exhaustive consensus sequence. 
3 0 The conserved tryptophan residue shown in bold after residue 
Xio is the only conserved tryptophan residue in the C4 
domain. As used herein, a C4 domain (W _ >X) point mutation is 
a mutation of the above- identified conserved C4 domain 
tryptophan residue to an amino acid residue other than 



WO 94/22477 

tryptophan. For example, a 
a imitation of the conserved 
a valine residue. 
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C4 domain^..^ point mutation is 
C4 domain tryptophan residue to 



5 In one embodiment, the C4 domain is an HIV-l^ gpl20 
envelope glycoprotein C4 domain. The sequence of the HIV- 
Ilaj gpl2 0 C4 domain is: TLPCRIKQFINMWQEVGKAMYAPPISGQIRCS - 
SNITGLLLTRDGG . The mutant HIV-1 gpl20 envelope glycoprotein 
may be a mutant HIV-Ilai gpl20 envelope glycoprotein. 

10 

In another embodiment, the C4 domain is an HIV-1jr.pl gpl20 
envelope glycoprotein C4 domain. The sequence of the HIV-1 JR . 
fl gpl20 C4 domain is: 'tLPCRIKQIINMWQEVGKAMYAPPIRGQIRCS- 
SNITGLLLTRDGG * The mutant HIV-1 gpl20 envelope glycoprotein 
15 may be a mutant HIV-Ijr.fl gpl20 envelope glycoprotein. 

HIV-l^j is a laboratory -adapted strain that is tropic for 
phyt ©hemagglutinin (PHA) - stimulated peripheral blood 
lymphocytes (PBLs) and immortalized human T-cell lines. In 

2 0 contrast, HIV-I^pl was isolated from brain tissue taken at 

autopsy that was co- cultured with lectin-activated normal 
human PBLs. HIV-1jr.pl is tropic for PHA- stimulated PBLs and 
blood-derived macrophages but will not replicate in 
transformed T-cell lines. Mutant HIV-1 gpl20 envelope 
25 glycoproteins derived from a clinical isolate of HIV-l such 
as JR-FL may possess new or different epitopes compared to 
the laboratory- adapted HIV-1 strains that are beneficial for 
successful vaccination. Although only the HIV-l^ and HIV- 
1jr.pl strains are used herein to generate the mutant HIV-1 

3 0 gpl20 envelope glycoproteins of the subject invention, other 

HIV-1 strain could be substituted in their place as is well 
known to those skilled in the art. 



The VI and V2 variable regions of gpl2 0 are unnecessary for 
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CD4 binding (21) . Therefore the mutant HIV-l gpl2 0 envelope 
glycoprotein of this invention can either include or exclude 
the VI and V2 variable regions. 

5 The subject invention additionally provides a recombinant 
nucleic acid molecule which encodes a mutant HIV-l gpi20 
envelope glycoprotein comprising a V3 loop deletion and a C4 
domain (Asp _ >X) point mutation, wherein the aspartate residue is 
between amino acid residues X l5 and X l6 in the C4 consensus 
10 sequence, and X is an amino acid residue other than 
aspartate or glutarnate. In the preferred embodiment, X is 
an alanine residue. 



The subject invention additionally provides a recombinant 
15 nucleic acid molecule which encodes a mutant HIV-l gpl20 
envelope glycoprotein comprising a V3 loop deletion and a C4 
domain (Glu _ >X) point mutation, wherein the glutarnate residue is 
between amino acid residues X 15 and X l6 in the C4 consensus 
sequence, and X is an amino acid residue other than 
20 aspartate or glutarnate. In the preferred embodiment, X is 
an alanine residue. 



The subject invention additionally provides a recombinant 
nucleic acid molecule which encodes a mutant HIV-l^ gpl20 
25 envelope glycoprotein comprising a V3 loop deletion and a C3 
domain (Mp 3 7g->X) point mutation, wherein X is an amino acid 
residue other than aspartate or glutarnate. In the preferred 
embodiment, X is a lysine residue. 

30 The subject invention additionally provides a recombinant 
nucleic acid molecule which encodes a mutant HIV-1jr.pl gpl20 
envelope glycoprotein comprising a V3 loop deletion and a C3 
domain (Iutp3e9 _ >X) point mutation, wherein X is an amino acid 
residue other than aspartate or glutarnate. In the preferred 
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embodiment, X is a lysine residue. 

The subject invention additionally provides a recombinant 
nucleic acid molecule which encodes a mutant HIV-l^, gpl20 
5 envelope glycoprotein comprising a V3 loop deletion and: a C3 
domain tghl380 .. >X) point mutation, wherein X is an amino acid 
residue other than glutamate. In the preferred embodiment, 
X is a glutamine residue. 

10 The subject invention additionally provides a recombinant 
nucleic acid molecule which encodes a mutant HIV-1jr.pl gpl20 
envelope glycoprotein comprising a V3 loop deletion and a C3 
domain (g | U 37 1 _ >X) point mutation, wherein X is an amino acid 
residue other than glutamate. In the preferred embodiment, 

15 X is a glutamine residue. 

The subject invention additionally provides a recombinant 
nucleic acid molecule which encodes a mutant HIV-1^ gpi20 
envelope glycoprotein comprising a V3 loop deletion and a C2 
2 0 domain (thr2 67->x) point mutation, wherein X is an amino acid 
residue other than threonine. In the preferred embodiment, 
X is an arginine residue. 

The subject invention additionally provides a recombinant 
25 nucleic acid molecule which encodes a mutant HIV-l JR . FL gpl20 
envelope glycoprotein comprising a V3 loop deletion and a C2 
domain (lhr 26o->x) point mutation, wherein X is an amino acid 
residue other than threonine. In the preferred embodiment, 
X is an arginine residue. 

30 

The subject invention additionally provides a recombinant 
nucleic acid molecule which encodes a mutant HIV-i gpi20 
envelope glycoprotein comprising (a) a V3 loop deletion, or 
(b) a one of the C2, C3 or C4 domain point mutations 
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discussed supra . 

The point mutations in the recombinant nucleic acid 
molecules described supra are selected based on their 
5 ability to reduce the affinity of the mutant gpl2 0 
glycoprotein encoded thereby for CD4 . As used herein, the 
term "reduce the affinity" means to reduce the affinity by 
at least two- fold. 

10 One skilled in the art would know how to make recombinant 
nucleic acid molecules which encode mutant HIV-1 gpl20 
envelope glycoproteins conqorising a V3 loop deletion and the 
specific C2, C3 or C4 domain point mutations corresponding 
to those mutations exemplified in the HIV-1jr.pl and HIV-l^, 

15 strains, supra . Furthermore, one skilled in the art would 
know how to use these recombinant nucleic acid molecules to 
obtain the proteins encoded thereby, and practice the 
therapeutic and prophylactic methods of using same, as 
described herein for the recombinant nucleic acid molecule 

20 which encodes a mutant HIV-l gpl20 envelope glycoprotein 
comprising a V3 loop deletion and a C4 domain (W „ >X) point 
mutation. 

The subject invention also provides the mutant HIV-1 gpl2 0 
25 envelope glycoprotein encoded by the recombinant nucleic 
acid molecule of the subject invention. 

In accordance with the invention, numerous vector systems 
for expression of the mutant HIV-l gpl20 envelope 
30 glycoprotein may be employed. For example, one class of 
vectors utilizes DNA elements which are derived from animal 
viruses such as bovine papilloma virus, polyoma virus, 
adenovirus, vaccinia virus, baculovirus, retroviruses (RSV, 
MMTV or MoMLV) , Semliki Forest virus or SV40 virus. 
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Additionally, cells which have stably integrated the DNA 
into their chromosomes may be selected by introducing one or 
more markers which allow for the selection of transfected 
host cells. The marker may provide, for example, prototropy 
5 to an auxotrophic host, biocide resistance, (-e.g., 
antibiotics) or resistance to heavy metals such as copper or 
the like. The selectable marker gene can be either directly 
linked to the DNA sequences to be expressed, or introduced 
into the same cell by cotransf ormation. Additional elements 
10 may also be needed for optimal synthesis of mRNA. These 
elements may include splice signals, as well as 
transcriptional promoters, enhancers, and termination 
signals. The cDNA expression vectors incorporating such 
elements include those described by Okayama (22) . 

15 

The vectors used in the subject invention are designed to 
express high levels of mutant HIV-1 gpl20 envelope 
glycoproteins in cultured eukaryotic cells as well as 
efficiently secrete these proteins into the culture medium. 
20 The targeting of the mutant HIV-1 gpl20 envelope 
glycoproteins into the culture medium is accomplished by 
fusing in- frame to the mature N- terminus of the mutant HIV-1 
gpl20 envelope glycoprotein the tissue plasminogen activator 
(tPA) prepro- signal sequence. 

25 

The mutant HIV-1 gpl20 envelope glycoprotein may be produced 
by a) transfecting a mammalian cell with an expression 
vector for producing mutant HIV-1 gpl20 envelope 
glycoprotein; b) culturing the resulting transfected 
30 mammalian cell under conditions such that mutant HIV-l gpl20 
envelope glycoprotein is produced; and c) recovering the 
mutant HIV-l gpl20 envelope glycoprotein so produced. 

Once the expression vector or DNA sequence containing the 
3 5 constructs has been prepared for expression, the expression 
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vectors may be transfected or introduced into an appropriate 
mammalian cell host. Various techniques may be employed to 
achieve this, such as, for example, protoplast fusion, 
calcium phosphate precipitation, electroporation or other 
5 conventional techniques. In the case of protoplast fusion, 
the cells are grown in media and screened for the 
appropriate activity. Expression of the gene encoding a 
mutant HIV-1 gpl20 envelope glycoprotein results in 
production of the mutant glycoprotein. 

10 

Methods and conditions for culturing the resulting 
transfected cells and for recovering the mutant HIV-l gpl20 
envelope glycoprotein so produced are well known to those 
skilled in the art, and may be varied or optimized depending 
15 upon the specific expression vector and mammalian host cell 
employed. 



In accordance with the claimed invention, the preferred host 
cells for expressing the mutant HIV-1 gpl20 envelope 

20 glycoprotein of this invention are mammalian cell lines. 
Mammalian cell lines include, for example, monkey kidney CV1 
line transformed by SV40 (COS-7) ; human embryonic kidney 
line 293? baby hamster kidney cells (BHK) ; Chinese hamster 
ovary -cells- DHFR (CHO) ; Chinese hamster ovary-cells DHFR" 

25 (DXB11) ; monkey kidney cells (CV1) ; African green monkey 
kidney cells (VERO-76) ; human cervical carcinoma cells 
(HELA) ; canine kidney cells (MDCK) ; human lung cells (W13 8) ; 
human liver cells (Hep G2) ; mouse mammary tumor (MMT 
060562); mouse cell line (C127) ; and myeloma cell lines. 

30 

Other eukaryotic expression systems utilizing non-mammalian 
vector/ cell line combinations can be used to produce the 
mutant HIV-1 gpl20 envelope glycoproteins- These include, 
but are not limited to, baculovirus vector/insect cell 
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expression systems and yeast shuttle vector/yeast cell 
expression systems. 

Methods and conditions for purifying mutant HIV-1 gpl20 
5 envelope glycoproteins from the culture media are provided 
in the invention, but it should be recognized that these 
procedures can be varied or optimized as is well known to 
those skilled in the art. 



10 The subject invention further provides a vaccine which 
comprises a therapeutically effective amount of the mutant 
HIV-1 gpl20 envelope glycoprotein of the subject invention, 
and an ad j uvant . 

15 A therapeutically effective amount of the mutant HIV-1 gpl20 
envelope glycoprotein may be determined according to methods 
well known to those skilled in the art. 



As used herein, adjuvants include, but are not limited to, 
20 alum, Freund's incomplete adjuvant (FIA) , Saponin, Quil A, 
Monophosphoryl lipid A (MPL) , and nonionic block copolymers 
(SAF) such as L-121 (Pluronic; Syntex SAF) . In the preferred 
embodiment, the adjuvant is alum, especially in the form of 
a thixotropic, viscous, and homogeneous aluminum hydroxide 
25 gel- The vaccine of the subject invention may be 
administered as an oil in water emulsion. Methods of 
combining adjuvants with antigens are well known to those 
skilled in the art. 

3 0 The subject invention further provides a method of treating 
an HIV-1 -infected subject, which comprises immunizing the 
HIV-1 -infected subject with the vaccine of the subject 
invention, thereby treating the HIV- 1- infected subject. 

35 As used herein, treating an HIV- 1- infected subject with the 
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vaccine of the subject invention means reducing in the 
subject either the population of HIV-l or HIV- 1 - infected 
cells, or ameliorating the progression of an HIV-l -related 
disorder in the subject. 

5 

As used herein, an "HIV-infected subject" means an 
individual having at least one of his own cells invaded by 
HIV-l. 

10 As used herein, "immunizing" means administering a primary 
dose of the vaccine to a subject, followed after a suitable 
period of time by one or more subsequent administrations of 
the vaccine, so as to generate in the subject an immune 
response against the CD4 -binding region of the mutant HIV-l 

15 gpi20 envelope glycoprotein in the vaccine. A suitable 
period of time between administrations of the vaccine may 
readily be determined by one skilled in the art, and is 
usually in the order of several weeks to months. 

20 In the preferred embodiment, the dose of vaccine 
administered is an amount sufficient to deliver to the 
subject between lOug and Img of the mutant HIV-l gpl20 
envelope glycoprotein. 

25 The subject invention further provides a vaccine which 
comprises a prophylactically effective amount of the mutant 
HIV-l gpl20 envelope glycoprotein of the subject invention, 
and an adjuvant, 

30 A prophylactically effective amount of the mutant HIV-l 
gpl20 envelope glycoprotein may be determined according to 
methods well known to those skilled in the art. 



35 



The subject invention further provides a method of reducing 
the likelihood of an HIV- 1 - exposed subject's becoming 
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infected with HIV-1, which comprises immunizing the HIV-l- 
exposed subject with the vaccine of the subject invention, 
thereby reducing the likelihood of the HIV- l-exposed 
subject's becoming infected with HIV-1. 

5 

As used herein, the subject's becoming infected with HIV-l 
means the invasion of the subject's own cells by HIV-1. 

As used herein, reducing the likelihood of a subject's 
10 becoming infected with HIV-1 means reducing the likelihood 
of the subject's becoming infected with HIV-1 by at least 
two -fold. For example, if a subject has a 1% chance of 
becoming infected with HIV-1, a two- fold reduction in the 
likelihood of the subject's becoming infected with HIV-l 
15 would result in the subject's having a 0.5% chance of 
becoming infected with HIV-1. In the preferred embodiment of 
this invention, reducing the likelihood of the subject's 
becoming infected with HIV-1 means reducing the likelihood 
of the subject's becoming infected with HIV-l by at least 
20 ten-fold. 

As used herein, an HIV-l -exposed subject is a subject who 
has HIV-l present in his body, but has not yet become HIV-1- 
infected. 

25 

The subject invention further provides a method of reducing 
the likelihood of a non-HIV-1 -exposed subject's becoming 
infected with HIV-1, which comprises immunizing the non-HIV- 
1- exposed subject with the vaccine of the subject invention, 
3 0 thereby reducing the likelihood of the non-HIV-1 -exposed 
subject's becoming infected with HIV-1. 

As used herein, a non-HIV-1 -exposed subject is a subject who 
does not have HIV-1 present in his body. 

35 
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The subject invention further provides a method of obtaining 
partially purified antibodies which specifically bind to the 
CD4- binding domain of HIV-1 gpl2 0 envelope glycoprotein, 
which method comprises (a) immunizing a non- HIV- 1- exposed 
5 subject with the vaccine of the subject invention, (b) 
recovering from the immunized subject serum comprising said 
antibodies, and (c) partially purifying said antibodies, 
thereby obtaining partially purified antibodies which 
specifically bind to the CD4 -binding domain of HIV-1 gpl20 
10 envelope glycoprotein. In the preferred embodiment, the 
subject is a human. 

As used herein, partially purified antibodies means a 
composition which comprises antibodies which specifically 

15 bind to the CD4 -binding domain of HIV-1 gpl20 envelope 
glycoprotein, and consists of fewer protein impurities than 
does the serum from which the ant i-CD4- binding domain 
antibodies are derived. A protein impurity means a protein 
other than the anti-CD4 -binding domain antibodies. For 

20 example, the partially purified antibodies might be an IgG 
preparation. 

Methods of recovering serum from a subject are well known to 
those skilled in the art. Methods of partially purifying 
25 antibodies are also well known to those skilled in the art, 
and include, by way of example, filtration, ion exchange 
chromatography, and precipitation. 

In one embodiment, the partially purified antibodies 
30 comprise an immune globulin (IG) preparation. IG can be 
purified from serum by a two-step process. Initially, serum 
is fractionated by the cold ethanol method of Cohn, et al. 
(29). Cohn Fraction II has as its main protein component 
IgG immunoglobulin present as monomers, dimers and 
35 aggregates. Fraction II is then purified to produce IVIG 
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(immune globulin intravenous) using a variety of 
purification methods which include, for examples, ion 
exchange, DEAE chromatography, acid pH 4.25 diaf iltration, 
PEG precipitation or Pepsin treatment. The final product is 
5 stabilized (e.g., glucose ■+ NaCl) and the final- IgG 
concentration is fixed at between about 3% and about 6%. 

The subject invention further provides the partially 
purified antibodies produced by the method of the subject 
10 invention. 

The subject invention further provides a pharmaceutical 
composition, which comprises a therapeutically effective 
amount of the partially purified antibodies of the subject 
15 invention, and a pharmaceutically acceptable carrier. 

A therapeutically effective amount of the partially purified 
antibodies of the subject invention may be determined 
according to methods well known to those skilled in the art. 

20 

Pharmaceutically acceptable carriers are well known to those 
skilled in the art and include, but are not limited to, 
0.01-0.1M and preferably 0.05M phosphate buffer or 0.8% 
saline. Additionally, such pharmaceutically acceptable 

25 carriers may be * aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous 
solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic 
esters such as ethyl oleate. Aqueous carriers include 

30 water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. 
Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated 
Ringer's or fixed oils. Intravenous vehicles include fluid 

35 and nutrient replenishers , electrolyte replenishers such as 
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(immune globulin intravenous) using a variety of 
purification methods which include, for example, ion 
exchange, DEAE chromatography, acid pH 4.25 diaf iltration, 
PEG precipitation or Pepsin treatment. The final product is 
5 stabilized (e.g., glucose + NaCl) and the final- IgG 
concentration is fixed at between about 3% and about 6%. 

The subject invention further provides the partially 
purified antibodies produced by the method of the subject 
10 invention. 

The subject invention further provides a pharmaceutical 
composition, which comprises a therapeutically effective 
amount of the partially purified antibodies of the subject 
15 invention, and a pharmaceutically acceptable carrier. 

A therapeutically effective amount of the partially purified 
antibodies of the subject invention may be determined 
according to methods well known to those skilled in the art. 

20 

Pharmaceutically acceptable carriers are well known to those 
skilled in the art and include, but are not limited to, 
0.01-0.1M and preferably 0.05M phosphate buffer or 0.8% 
saline. Additionally, such pharmaceutically acceptable 

25 carriers may be aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non- aqueous 
solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic 
esters such as ethyl oleate. Aqueous carriers include 

3 0 water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. 
Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated 
Ringer's or fixed oils. Intravenous vehicles include fluid 

35 and nutrient replenishers, electrolyte replenishers such as 
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those based on Ringer's dextrose, and the like. Preserva- 
tives and other additives may also be present, such as, for 
example, antimicrobials, antioxidants, chelating agents, 
inert gases and the like. 

5 

The subject invention further provides a method of treating 
an HIV- 1- infected subject, which comprises administering to 
the subject a dose of the pharmaceutical composition of the 
subject invention ef-fective to reduce the population of HIV- 
10 1- infected cells in the HIV- 1 - infected subject, thereby 
treating the HIV- 1- infected subject. 

As used herein, administering may be effected or performed 
using any of the various methods known to those skilled in 
15 the art. The administering may comprise administering 
intravenously. The administering may also comprise 
administering intramuscularly. The administering may 
further comprise administering subcutaneously. 

2 0 The dose of the pharmaceutical composition of the subject 
invention effective to reduce the population of HIV-l- 
infected cells in the HIV- 1- infected subject may be readily 
determined using methods well known to those skilled in the 
art. In the preferred embodiment, the dose is sufficient to 

25 deliver to the subject between about 10 mg/kg and I50mg/kg 
of protein if administered intramuscularly. In the 
preferred embodiment, the dose is sufficient to deliver to 
the subject between about 100 mg/kg and 2g/kg of protein if 
administered intravenously. 

30 

The subject invention further provides a method of treating 
an HIV- 1- infected subject, which comprises administering to 
the subject a dose of the pharmaceutical composition of the 
subject invention effective to reduce the population of HIV- 
35 1 in the HIV- 1- infected subject, thereby treating the HIV-1- 
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those based on Ringer's dextrose, and the like. Preserva- 
tives and other additives may also be present, such as, for 
example, antimicrobials, antioxidants, chelating agents, 
inert gases and the like. 

5 

The subject invention further provides a method of treating 
an HIV- 1- infected subject, which comprises administering to 
the subject a dose of the pharmaceutical composition of the 
subject invention ef-fective to reduce the population of HIV- 
10 1-infected cells in the HIV- 1- infected subject, thereby 
treating the HIV- 1- infected subject. 

As used herein, administering may be effected or performed 
using any of the various methods known to those skilled in 
15 the art. The administering may comprise administering 
intravenously. The administering may also comprise 
administering intramuscularly. The administering may 
further comprise administering subcutaneously . 

20 The dose of the pharmaceutical composition of the subject 
invention effective to reduce the population of HIV-l- 
infected cells in the HIV- l-infected subject may be readily 
determined using methods well known to those skilled in the 
art. In the preferred embodiment, the dose is sufficient to 

25 deliver to the subject between about 10 mg/kg and 150mg/kg 
of protein if administered intramuscularly. In the 
preferred embodiment, the dose is sufficient to deliver to 
the subject between about 100 mg/kg and 2g/kg of protein if 
administered intravenously. 

30 

The subject invention further provides a method of treating 
an HIV- 1-infected subject, which comprises administering to 
the subject a dose of the pharmaceutical composition of the 
subject invention effective to reduce the population of HIV- 
35 1 in the HIV- 1 - infected subject, thereby treating the Hiv-l- 
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infected subject. 

The dose of the pharmaceutical composition of the subject 
invention effective to reduce the population of HIV-1 in the 
5 HIV- 1- infected subject may be readily determined .using 
methods well known to those skilled in the art. In the 
preferred embodiment, the dose is sufficient to deliver to 
the subject between about 10 mg/kg and 150mg/kg of protein 
if administered intramuscularly. In the preferred 

10 embodiment, the dose is sufficient to deliver to the subject 
between about 100 mg/kg and 2g/kg of protein if administered 
intravenously . 

The subject invention further provides a composition which 
15 comprises a prophylactically effective amount of the 
partially purified antibodies of the subject invention, and 
a pharmaceutically acceptable carrier. 

A prophylactically effective amount of the partially 
2 0 purified antibodies of the subject invention may be 
determined according to methods well known to those skilled 
in the art. 

The subject invention further provides a method of reducing 
25 the likelihood of an HIV-l-expqsed subject's becoming 
infected with HIV- 1 , which comprises administering to the 
HIV-1 -exposed subject a dose of the composition of the 
subject invention effective to reduce the population of HIV- 
1 in the HIV-1 -exposed subject, thereby reducing the 
30 likelihood of the subject's becoming infected with HIV-l. 

In one embodiment, the subject is a medical practitioner. 
The medical practitioner may be . a medical practitioner 
exposed to an HIV- 1- containing bodily fluid. As used herein, 
35 the term "medical practitioner" includes, but is in no way 
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limited to, doctors, dentists, surgeons, nurses, medical 
laboratory assistants, and students in health care programs. 

In another embodiment, the subject is a newborn infant. The 
5 newborn infant may be a newborn infant born to an HIV-l- 
infected mother. 

The dose of the composition of the subject invention 
effective to reduce the population of HIV-l in the HIV-1- 

10 exposed subject may be readily determined using methods well 
known to those skilled in the art. In the preferred 
embodiment, the dose is sufficient to deliver to the subject 
between about lOmg/kg and 150mg/kg of protein if 
administered intramuscularly. In the preferred embodiment, 

15 the dose is sufficient to deliver to the subject between 
about 100 mg/kg and 2g/kg of protein if administered 
intravenously . 

The vaccines and pharmaceutical compositions of the subject 
20 invention may also ameliorate the progression of an HIV-l- 
related disorder in a subject to whom the vaccines or 
pharmaceutical compositions were administered while the 
subject was either non- HIV- 1- exposed or HIV- 1- exposed, but 
not yet HIV-l -infected. 

25 

Finally, the subject invention provides a method of reducing 
the likelihood of a non- HIV-l -exposed subject's becoming 
infected with HIV-l as a result of exposure thereto during 
an incident wherein there is an increased risk of exposure 

3 0 to HIV-l, which comprises administering to the subject 
immediately prior to the incident a dose of the composition 
of the subject invention effective to reduce the population 
of HIV-l to which the subject is exposed during the 
incident, thereby reducing the likelihood of the subject's 

3 5 becoming infected with HIV-l. In one embodiment, the 



WO 94/22477 PCT/US94/032&2 

33 

subject is a medical practitioner. 

An incident wherein there is an increased risk of exposure 
to HIV-1 includes, for example, receiving a blood 
5 transfusion, sexual contact with an HIV- 1 - infected 
individual, and performing a HIV-1 -containing bodily fluid- 
exposing medical procedure. 

As used herein, "immediately prior to the incident" means 
10 within one month of the incident. In the preferred 
embodiment, "immediately prior to the incident" means within 
one day of the incident. 

The dose of the composition of the subject invention 
15 effective to reduce the population of HIV-1 to which the 
subject is exposed during the incident may be readily 
determined using methods well known to those skilled in the 
art. In the preferred embodiment, the dose is sufficient to 
deliver to the subject between about lOmg/kg and 150mg/kg of 

2 0 protein if administered intramuscularly. In the preferred 

embodiment, the dose is sufficient to deliver to the subject 
between about lOOmg/kg and 2g/kg of protein if administered 
intravenously. 

25 One embodiment of this invention is a method of 
substantially reducing the likelihood of a non- infected 
medical practitioner's becoming infected with HIV-1 during 
a bodily fluid- exposing medical procedure involving a 
patient, which comprises administering to the patient during 

3 0 a suitable time period an amount of the composition of the 

subject invention effective to substantially reduce the 
likelihood of the non-infected medical practitioner's 
becoming infected with HIV-1 by virtue of contact with the 
patient's bodily fluid during the medical procedure. 

35 
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As used herein, a bodily fluid is any fluid which is present 
in the human body and is capable of containing infectious 
HIV-l in an HIV- 1- infected patient. Bodily fluids include, 
but are not limited to, saliva, cerebrospinal fluid, tears, 
5 vaginal secretions, urine, alveolar fluid, synovial fluid 
and pleural fluid* 

Another embodiment of this invention is a method of 
substantially reducing the likelihood of a non-HTV-l- 

10 infected newborn infant's becoming infected with HIV-l prior 
to or during birth from an HIV-l -infected mother, which 
comprises administering to the mother prior to birth an 
amount of the composition of the subject invention effective 
to substantially reduce the likelihood of the non-HIV-1- 

15 infected newborn infant's becoming infected with HIV-l by 
virtue of contact with the patient's bodily fluid. 

In order to facilitate an understanding of the Experimental 
Details section which follows, certain frequently occurring 
20 methods and/or terms are best described in Maniatis et al. 

(23). 

This invention will be better understood by reference to the 
Experimental Details which follow, but those skilled in the 
25 art will readily appreciate that the specific experiments 
detailed are only illustrative of the invention as described 
more fully in the claims which follow thereafter. 
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Experimental Details 
Nomen c 1 a ture 

As used herein, V3 H indicates a V3 loop deletion from HIV-l 
5 gpl20 envelope glycoprotein. As used herein, CD4 (-> indicates 
a point mutation in the C4 domain of HIV-l gpl20 envelope 
glycoprotein which mutation inhibits CD4 binding to the 
mutant HIV-l gpl20 envelope glycoprotein. The structure of 
HIV-l gpl20 envelope glycoprotein is shown in Figure 1. 
10 Materials and Methods 

1. Construction of PPI4- tPA-gpl20 T M expression vector. 
An expression vector was constructed that consisted of the 
cytomegalovirus major immediate early (CMV MIE) 

15 promoter/enhancer linked to the HlV-l^gny gene, which gene 
had its signal sequence replaced by the tPA signal sequence. 
The CMV MIE promoter/enhancer sequences were derived from 
pSVCCl (24) consisting of 1580 base pairs of contiguous DNA 
that is immediately 5' to the initiator ATG. In sequential 

20 order, the functional domains of the CMV promoter are: the 
promoter/enhancer region; a transcriptional initiator site; 
exon A (a non- coding exon) ; intron A; and 17 nucleotides of 
exon B (non- coding sequences) . The viral promoter sequences 
were ligated to a gene construct consisting of the 

25 nucleotide sequences encoding amino acids -35 to -1 of human 
tPA (25) fused in- frame to HIV - l UI env amino acids 31 through 
515, ending with a TGA stop codon. The construction was 
performed in two parts. The majority of the CMV promoter 
could be isolated as a 1560 bp Hinc II/Pst I fragment which 

30 was ligated to a Pst I/Not I 1590 bp DNA fragment that 
contained the remainder of the CMV promoter, the initiator 
ATG, the tPA signal sequence and the mature HIV-1^ env 
protein coding sequence. 
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The latter fragment was assembled using the polymerase chain 
reaction as follows. Primer 1 (GATCCTGCAGTCACCGTCCTTGACA- 
CGATGGATGCAATGAAGAGA) and primer 2 (AAGTCTTCTCCTCGGTCTTGT- 
CTTTTTAACACCCAG) were used to amplify the nucleic acid 
5 sequences encoding the tPA signal sequence amino acids -35 
to -l from plasmid pMAM neo-s (Clonetech) , thus producing a 
150 bp fragment- A second 144 0 bp DNA fragment was amplified 
using primer 3 ( TTGAGAAGAGGAGCCAGAACAGAAAAATTGTGGGTC ) , 
primer 4 (GGAAAAAAGCGGCCGCTCATTTTTCTCTCTGCACCACTC) , and pENV 

10 (26) as a template. The PCR fragments were pooled, 
desalted, and excess primer removed by ultrafiltration 
through a centricon-100 unit (Amicon) . An aliquot of the 
pooled material was then subjected to a second round of 
amplification in the presence of primers 1 and 4 to produce 

15 a 1590 bp fragment, which was then digested with Pst I and 
Not I. The CMV promoter fragment and the HIV-l^ env 
fragment were then ligated together, and the entire 
transcription unit subcloned into PPI4, which is a 
eukaryotic shuttle vector that contains an ampicillin 

20 resistance gene, an SV40 origin of replication and a DHFR 
gene whose transcription is driven by the S-globin promoter. 
The final construct, PPI4- tPA-gpl20 LAI , is shown in Figure 2. 

The expression vector is then used as the prototype vector 
25 for the expression of gpl20 proteins that are derived from 
other HIV-1 strains or mutated as described in the methods 
section. The vector was constructed so that unique Nar I 
and Not I sites flank the gpl20 sequence, thus facilitating 
the removal of the gpl2 0 gene cassette and the subsequent 
3 0 insertion of other gene cassettes (Figure 2) . 



2. Expression of HIV- 1^ gp!20 in mammalian cells , 
a. Trans ient express ion . 

CosMS cells grown in DMEM containing 10% fetal calf serum 
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were split to 75% confluence. On the following day, the 
cells were transfected for 16-20 hours with 10 micrograms of 
CsCl -purified PPI4- tPA-gpl20 LAI DNA by the standard CaP0 4 (5) 
precipitation technique. After transf ection, fresh medium 
5 was added to the cells. Analysis of the products 
synthesized 96-120 hours post- transf ection was performed by 
radiolabelling the transf ectants with 35 S- cysteine for i2-18 
hours, followed by precipitation of media using a CD4- 
immunoglobul in -Protein A-Sepharose complex, followed by SDS- 
10 PAGE under reducing conditions (Figure 4) . 



b. Stable expression . 

Dhfr - Chinese hamster ovary cells (CHO) were transfected 
with 20 micrograms of CsCl-purif ied DNA. Approximately 3-5 

15 days post- transf ection, cells were placed in selective 
medium (nucleoside- free alpha MEM containing 10% dialyzed 
fetal calf serum) . Approximately 10-15 days post -selection, 
individual cell clones were picked. Media was analyzed for 
gpl2 0 expression by radiolabelling the cells with 35 S- 

20 cysteine for 12-18 hours, followed by precipitation of media 
using a CD4- immunoglobulin- Protein A-Sepharose complex, 
followed in turn by SDS-PAGE under reducing conditions 
(Figure 6) . The levels of gpl20 in the media of these 
clones were also quantitated (Figure 5) by ELISA performed 

25 as follows. The method involves coating 96 -well plates 
overnight with sheep polyclonal IgG against the highly 
conserved C-terminus of gpl20 (D7234, Aalto Bioreagents) . 
After washing, dilutions of a standard gpl20 preparation in 
cell growth medium, or supernatant from the stably- 

30 transfected cells, were incubated for 1 hour. The plates 
were washed again, and incubated for one hour with a 
horseradish peroxidase- conjugated anti-gpl20 monoclonal 
antibody (9204, DuPont) . Following a final wash, the 
peroxidase substrate OPD (DuPont) was added and the amount 
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of gpl2 0 determined by comparing absorbance of unknowns with 
a standard curve. Standards were prepared from purified 
gpl20 made in CHO cells, a small quantity of which was 
obtained from Celltech Ltd- Clones expressing the highest 
5 levels were subjected to successive rounds of amplification 
of the newly introduced DNA sequences in increasing 
concentrations of methotrexate. Stable CHO cell lines were 
thus generated which secrete at least 1 microgram/milliliter 
of HIV-Ila, gpl20. 

10 

3 . Construction of PPI4 - tPA-opl20 F ff ^ 

a. The HIV-l^j gpl20 env nucleotide sequence in PPI4-tPA- 
gpl20 LA1 was replaced by the nucleotide sequence encoding the 
mature gpl2 0^.^ protein* Using the polymerase chain 

15 reaction, the JR-FL sequences were amplified from pUC112-l 
(27) using primer 5 (GATCGGCGCCAGAGTAGAAAAGTTGTGGGTCAC) and 
primer 4. The PCR fragment was digested with the 
restriction endonucleases Nar I and Not I, and the fragment 
subcloned in between the Nar I and Not I sites in PPI4-tPA- 

2 0 gpl20 LAI to generate PPI4- tPA-gpl20jR.pL (Figure 7) . 

b. Trans i ent express i on . 

CosM5 cells grown in DMEM containing 10% fetal calf serum 
were split to 75% confluence. On the following day, the 
25 cells were transfected for 16-20 hours with 10 micrograms of 
CsCl -purified PPI4- tPA-gp^Oj,^ DNA by the standard CaP0 4 (5) 
precipitation technique. After transfection, fresh medium 
was added' to the cells. Analysis of the products 
synthesized 96-120 hours post- transf ection was performed by 

3 0 radiolabelling the transf ectants with 35 S- cysteine for 12-18 

hours, followed by precipitation of media using a CD4- 
immunoglobul in- Protein A- Sepharose complex, followed by SDS- 
PAGE under reducing conditions (Figure 4) . 
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4 . Construction of PPI4 - tPA- crpl2 0 L M -V3 M . 

The V3 loop in tPA-gpl2 0 LAI consists of amino acids Cys 306 
through Cys 333 . In the V3* 0 mutant, the amino acids in 
between these cysteines are replaced by the pentapeptide 
5 sequence Thr-Gly- Ala-Gly-His . Using the Transformer Site- 
Directed Mutagenesis Kit (Clonetech) , the V3 loop sequence 
in PPI4- tPA-gpl2 0 LAI is altered using the mutagenic primer 6 
(CTGTAGAAATTAATTGTACAGGTGCTGGACATTGTAACATTAGTAGAGC) and 
primer 7 (CTCGAGCATGCATTCGAAGCTCGCTGATC) as a selection 

10 primer. Primer 7 changes a unique Xba I site in the 
backbone of the parent PPI4 plasmid into a unique BstB I 
site. Briefly, the mutagenesis method requires incubating 
of the parent plasmid with the mutagenic primer and the 
selection primer, denaturing at 100°C for 3 minutes and then 

15 chilling on ice. In the presence of buffered deoxynucleo- 
tide triphosphates and T4 DNA polymerase, the primers are 
allowed to initiate the polymerization of one strand of 
plasmid DNA. T4 DNA ligase is used to seal the newly 
synthesized DNA strand to form a covalently closed circle. 

2 0 Hybrid plasmids are then transformed into a MutS strain of 
E . coli that is deficient in mismatch repair. After 
allowing for the growth of transformed cells, DNA is 
purified from the cells and digested with the selection 
restriction endonuclease, in this case Xba I. Parental 

25 plasmids -are cleaved by Xba I while the mutant plasmid 
remains resistant to cleavage by virtue of the Xba I to BstB 
I conversion. Digested DNA is then used to transform E . 
coli , and colonies harboring the mutant plasmid are picked. 
Multiple mutagenic primers can be used in a single round of 

30 mutagenesis. The amino acid sequence of the modified 
protein is shown in Figure 8 . 

5. Construction of PPI4- tPA-apl2Qr P _ F L -V3 ( ' ) . 

The V3 loop in tPA-gpl20jR.pL consists of amino acids Cys 293 
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through Cys 327 . In the V3 H mutant, the amino acids in 
between these cysteines are replaced by the pentapeptide 
sequence Thr-Gly-Ala-Gly-His . Using the Transformer Site- 
Directed Mutagenesis Kit (Clonetech) , the V3 loop sequence 
5 in PPI4 - tPA-gpl2 0j R _pL is altered using the mutagenic primer 
6 ( CTGTAGAAATTAATTGTACAGGTGCTGGACATTGTAACATTAGTAGAGC ) and 
primer 7 as a selection primer. The amino acid sequence of 
the modified protein is shown in Figure 9, 

10 G. Construction of PPI4 - tPA- apl20 LAI - CD4 { 'K 

Using the Transformer Site-Directed Mutagenesis Kit 
(Clonetech) , the selection primer 7, and the mutagenic 
primer 8 (CAATTTATAAACATGGTGCAGGAAGTAGG) , Trp 437 of tPA- 
gpl20 LAJ/ which is in an equivalent position to the 

15 tryptophan residue in the HXBc2 strain of HIV-1, is mutated 
to a Val in the expression vector PPI4- tPA-gpl20 LA1 to 
generate PPI4- tPA-gpl20 LAI -CD4 < " ) . The sequence for gpl2 0 LAI - 
CD4 (_) is shown in Figure 12. 

2 0 7. Construction of PPI4- tPA-crpl20 m F L -CD4 ( ' ) . 

In a fashion similar to that described above, Trp 42 4 of tPA- 
gpi20 JR .pL is mutated to a Val in the expression vector PPI4- 
tPA-gpl2 

Ojr-fl using the selection primer 7 and the mutagenic 
primer 9 (CAAATTATAAACATGGTGCAGGAAGTAGG) to generate PPI4- 
25 tPA-gpl20 JR . FL -CD4 ( * ) . The sequence for gpl20 JR _ FL -CD4 ( ' ) is shown 
in Figure 13 . 

8. Construction of PPI4- tPA-crol20, ..-VS^-CD^ . 
The tPA-gpl20 LAI double mutant, V3 ( " ) -CD4 ( * ) , is constructed by 
30 including the mutagenic primers 6 and 8, and the selection 
primer 7 simultaneously in the reaction tube with PPI4-tPA- 
gpl20 lJU as the DNA template. The final construct is named 
PPI4- tPA-gpl20 LAI -V3°-CD4 ( " ) , and its sequence is shown in 
figure 10. 
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9 . Construction of PPI4 - tPA-crpl20 m F L -V3 ( ~ > - CD4 H . 
The tPA-gpl20jR.PL double mutant f V3 ( * ) -CD4 ( * ) , is constructed by 
including the mutagenic primers 6 and 9 , and the selection 
primer 7 simultaneously in the reaction tube with PPI4-tPA- 
5 gpl20 JR . FL as the DNA template. The final construct is named 
PPI4- tPA-gpl20 JR . FL -V3 ( ' ) -CD4 < * ) / and its sequence is shown in 
figure 11. 

10 . Expression of mutant HIV-l crpl20 in mammalian cells . 

a. Trans i en t expr e ssion . 

CosMS cells grown in DMEM containing 10% fetal calf serum 
are split to 75% confluence. On -the next day, the cells are 
transfected for 16-20 hours with 10 micrograms of CsCl- 
purified mutant HIV-l DNA by the standard CaP0 4 (5) 
precipitation technique. After transf ect ion, fresh medium 
is added to the cells. Analysis of the products synthesized 
96-120 hours post- transf ection is performed by 
radiolabellihg the transf ectants with 3S S- cysteine for 12-18 
hours, followed by precipitation of media using a sheep 
polyclonal IgG against the highly conserved C- terminus of 
gpl20. 

b. Stable expression . 

Dhfr" Chinese hamster ovary cells (CHO) are transfected with 
25 20 micrograms of CsCl -purified DNA encoding the native or 
mutant HIV-l gpl20 glycoproteins. Approximately 3-5 days 
post- transf ection, cells are! placed in selective medium 
(nucleoside- free alpha MEM containing 10% dialyzed fetal 
calf serum) . Approximately 10-15 days post -selection, 
3 0 individual cell clones are picked. Media is analyzed for 
gpl2 0 expression by radiolabelling the cells with 35 S- 
cysteine for 12-18 hours, followed by quantitative 
immunoprecipitation of media using a sheep polyclonal IgG 
against the highly conserved C- terminus of gpl20, followed 



15 
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in turn by SDS-PAGE under reducing conditions. 
Alternatively, one can quantitate the level of gpl20 by 
ELISA performed as follows. The method involves coating 96- 
well plates overnight with sheep polyclonal IgG against the 
5 highly conserved C- terminus of gpI20 (D7234, Aalto 
Bioreagents). After washing, dilutions of a standard gpl20 
preparation in cell growth medium, or supernatant from the 
stably- transfected cells, are incubated for 1 hour. The 
plates are washed again, and incubated for one hour with a 

10 human MoAb (F105, AIDS Research & Reference Reagent Program, 
No. 857) . The plates are washed again, and incubated again 
for 1 hour with a horseradish-peroxidase- conjugated goat 
anti-human IgG (Cappel) . Following a final wash, the 
peroxidase substrate OPD (DuPont) is added and the amount of 

15 gpl20 determined by comparing absorbance of unknowns with a 
standard curve. Standards are prepared from purified gpl20 
made in CHO cells, a small quantity of which is obtained 
from Celltech Ltd. Clones expressing the highest levels are 
subjected to successive rounds of amplification of the newly 

2 0 introduced DNA sequences in increasing concentrations of 

methotrexate. Stable CHO cell lines are thus generated 
which secrete at least 1 microgram/milliliter of mutant HIV- 
1 gpl20. 

25 11. Purification of HIV-1 opl20 proteins . 

A one- step immunoaf f inity procedure is used to purify the 
recombinant gpl2 0 molecules described. Briefly, culture 
supernatant is collected and clarified by centrif ugation. 
An immunoaf f inity column consisting of a matrix coupled to 

3 0 a sheep polyclonal anti-gpl20 IgG (D7234, Aalto Bioreagents) 

directed against the highly conserved C- terminal end 
(APTKAKRRWQREKR) of gpl20 is used to specifically adsorb 
gpl20 from the cell culture media. This antisera recognizes 
native gpl20, the V3 loop deletion mutants, and the CD4 W 
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mutants since the C- terminal ends of these molecules remain 
unaltered. The bound gp!20 is then eluted with 2M MgCl 2 , 
concentrated by Amicon filtration, and dialyzed into 10 mM 
HEPES, pH 7.0, The purity of the proteins is determined by 
5 SDS-PAGE and silver staining. 

12 . Characterization of recombinant HIV-1 crpl20 proteins - 
The purified glycoproteins are subjected to extensive 
biochemical and immunologic characterization. The integrity 

10 of the proteins is monitored by SDS-PAGE and silver staining 
under reducing and non- reducing conditions. The 
glycoproteins are deglycosylated by treatment with the 
enzyme N-glycosidase F which cleaves N- linked oligo- 
saccharides, and are assayed by SDS-PAGE and silver staining 

15 to monitor molecular weight shifts. The purified 

glycoproteins are also tested for reactivity with several 
well characterized anti-gpl20 monoclonal antibodies that 
recognize both linear and discontinuous epitopes. The 
binding affinity to sCD4 is estimated using an ELISA assay. 

20 

The purified proteins HIV-1 gp!20 LAI , gpl20 LAJ -V3 < - > / gpl20 LA1 -V3 < * 
>-CD4 ( -\ gpl20jR.PL, gpl20 JR . FL -V3 H , and gpl20^. FL -V3 w -CD4 w # were 
tested for their ability to bind cell surface human CD4. 
DG44 #3 cells, a recombinant cell line designed to express 

25 human CD4 on the membrane surface, were grown in T. flasks 
and trypsinized. 5 X 10 5 cells/experiment were aliquoted 
into FACS buffer (PBS + 2% BSA and 0.1% NaN 3 ) , washed 
several times in the same buffer, and then incubated with 
100 ul of a solution of purified gpl20 protein at 5ug/ml in 

30 FACS buffer at 37°C for 2 hr. The cells were washed in FACS 
buffer, and then incubated in 100 ul solution containing 
5ug/ml sheep polyclonal IgG against the highly conserved C- 
terminus of gp!20 in FACS buffer at 37 °C for 2 hr. The 
cells were washed in FACS buffer then incubated in 100 ul 
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solution containing FITC- labeled rabbit anti-sheep IgG 
polyclonal antibody at 37°C for 2 hr. The cells were washed 
with FACS buffer and then resuspended in 500 ul FACS buffer. 
The cells were then analyzed on a Bee ton Dickinson FACScan 
5 according to the manufacturer's instructions. As a control 
for expression of CD4 on the DG44 #3 cells, FITC- labeled 
0KT4A (Becton Dickinson) was used. 

13 . A protocol for inoculation of animals with the mutant 
10 HIV-1 gpl20 envelope glycoproteins . 

Alum is used as an adjuvant during the inoculation series. 
The inoculum is prepared by dissolving the mutant HIV-1 
gp!20 envelope glycoprotein antigen in physiologic saline at 
a final antigen concentration of 100 ug/ml . Preformed alum 

15 (aluminum hydroxide gel) is added to the solution to a final 
level of 5 00 ug/ml aluminum. The antigen is allowed to 
adsorb onto the alum gel for two hours at room temperature. 
Following adsorption, the gel with the antigen is washed 
twice with physiologic saline and resuspended in the saline 

20 to a protein concentration of 100 ug/ml. 

Monkeys and/or Guinea Pigs are individually inoculated with 
four 100 ug doses of the mutant HIV-l gpl20 envelope 
glycoprotein antigen adsorbed onto alum. Each dose is 
25 injected intramuscularly. The doses are delivered one or 
five months apart (week 0, 4, 8 and 28). the animals are 
bled at intervals of two or four weeks. Serum samples are 
prepared from each bleed to assay for the development of 
specific antibodies as described in the subsequent sections. 

30 

14 . Analysis of sera for anti -mutant HIV-1 opl20 envelope 
glycoprotein IgG antibodies . 

Each serum sample is analyzed by ELISA. Polystyrene 
microtiter plates are coated with 0.5 ug per well of pure 
35 mutant HIV-1 gpl20 envelope glycoprotein in phosphate- 
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buffered physiological saline (PBS) at 4°C. Each well is 
then washed with PBS containing 0.5% TWEEN-20 (PBS-TW). 
Test serum, diluted serially in PBS-TW, is added to the 
mutant HIV-l gpl20 envelope glycoprotein- containing wells 
5 and allowed to react with the adsorbed mutant HIV-l .gpl20 
envelope glycoprotein for one hour at 37°C. The wells are 
then washed extensively in PBS-TW. Each well then receives 
0.1% p-nitrophenyl phosphate in 10% diethanolamine, pH 9.8, 
containing 0.5 mM MgCl 2 .6H 2 0. The ensuing reaction is 
10 allowed to proceed at room temperature for 30 minutes, at 
which time it is terminated by the addition of 3 . 0 N NaOH. 

The greater the interaction of antibodies in the test serum 
with the mutant HXV-1 gp!20 envelope glycoprotein, the 

15 greater is the amount of alkaline phosphatase bound onto the 
well. The phosphatase enzyme mediates the breakdown of p- 
nitrophenyl phosphate into a molecular substance which 
absorbs light at a wavelength of 405 nm. Hence, there 
exists a direct relationship between the absorbance at 4 05 

2 0 nm of light at the end of the ELISA reaction and the amount 
of mutant HIV-l gpl20 envelope glycoprotein- bound antibody. 
All animals inoculated with mutant HIV-l gpl20 envelope 
glycoprotein whose serum reacts specifically with the mutant 
HIV-l gpl20 envelope glycoprotein in the ELISA have a 

25 positive antibody response against mutant HIV-l gpi20 
envelope glycoprotein. 

15 . Analysis of sera for activity which specifically 
neutralizes HIV-l infectivity . 
30 Virus -neutralizing activity is determined with an assay 
based on the use of multiplicity curves in which the ratio 
of infectious virus surviving antibody treatment (V n ) is 
compared to infectious virus in uninhibited cultures (V 0 ) at 
various dilutions of antisera. The neutralization titer of 
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the sera is then interpolated as that sera dilution which 
yields one log reduction in infectious titer (i.e., V 0 /V o = 
0.1). Briefly, 4-fold dilutions of virus (laboratory- 
adapted and primary isolates) are prepared to yield 
5 infectious doses of 0.1 to 100 TCID 50 (Tissue Culture 
Infection Dose) in 20 ul . Serial 3 -fold dilutions of sera 
are also prepared and 20 ul of each serum dilution are 
incubated with each dilution of virus in duplicate for 60 
minutes at room temperature in a 96 -well microtiter plate. 

10 20 ul of AA5 cells (PHA stimulated PBMCs for primary HIV-l 
isolates) are then added to the serum/virus mixtures. Cells 
are cultured for 7 days by the addition of fresh medium 
every other day. On the seventh day, supernatant from each 
well is removed and tested for the presence of reverse 

15 transcriptase (RT) . Infection in each well is then scored 
as either positive or negative based on the RT counts, and 
the infectious dose of virus in each treatment group is 
calculated using the Reed and Muench (28) formula. The 
neutralization titers represent the reciprocal serum 

2 0 dilution required to reduced infectious dose of virus by one 
log. The above culture time is for the prototypic HIV-l^ 
isolate tested on the AA5 cell line. In the case of primary 
isolates, the termination date is usually 11-14 days. 
Culture conditions for PBMCs is not as demanding since 

25 doubling time is restricted. In the case of PBMCs, one day 
PHA stimulations are used at a final concentration of 1.5 X 
10 6 /ml on day 0. Half that number of fresh PBMCs are then 
added again on days 4 and 8. This multiple addition of 
PBMCs is meant to amplify virus output upon successful 

30 infection so that the readout RT signal is strong. Again, 
the final readout titer for the primary isolate/PBMC is the 
reciprocal serum dilution which reduces infectious titer by 
one log. 



5*4722477 PCT/US94/03282 

47 

16 • Passive hyper immune therapy . 

Non-HIV-l- infected humans are immunized with the mutant HIV- 
1 gpl20 envelope glycoprotein antigens according to a 
protocol similar to that described above in section 12. For 
5 passive hyperimmune therapy in HIV- l- infected individuals, 
blood plasma is taken from mutant HIV- 1 gpl20 envelope 
glycoprotein immunized, non- HIV- l- infected human donors 
whose plasma has high levels of neutralizing antibodies. 
The plasma is pooled from several donors, purified to remove 
10 nonimmunoglobulin proteins and is then sterilized to kill 
any other viruses or pathogens. The treated plasma is then 
injected into individuals infected with HIV-1, with repeated 
injections every week, every two weeks, or every month. 
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Results 

Eukaryotic expression vectors designed to express high 
levels of HIV-1^, gpl20 and HIV-1jr.pl gpl20 were constructed. 
5 The CMV MIE promoter/enhancer was used to drive*, the 
transcription of a gene fusion consisting of the human tPA 
signal sequence fused to mature gpl20 (Figures 2 and 7) . 
The complete sequence of the transcription unit from the 
Hinc II site of the CMV promoter/enhancer to the Not I site 

10 just 3' from the stop codon in gpl20 is shown in figure 3. 
This vector was used to transf ect COSM5 cells in a transient 
assay- The transf ected cells were labeled with 35 S- cysteine 
and the media immunoprecipitated with a CD4- immunoglobulin- 
Protein A-Sepharose complex. The precipitated products were 

15 analyzed using a reducing 10% SDS-PAGE gel and 
autoradiography (Figure 4) . A 120 kD band was detected when 
PPI4-tPA-gpl2 0 lJU was used to transf ect COS cells (lane 3) . 
A band migrating with a slightly lower molecular mass was 
detected when PPI4- tPA-gpl20 JR . FL was used to transf ect COS 

20 cells (lane 4) . No radiolabeled products were detected in 
the mock infected cells. Using a sheep polyclonal antibody 
directed against the highly conserved C- terminal end of HIV- 
1 gpl20 in an ELISA assay, the level of expression of HIV-l 
gpl20 was determined to be 2350 ng/ml. 

25 

The PPI4-tPA-gpl20 1 ^ I vector was then used to stably 
transf ect the dhfr' CHO cell line DXBll. Two days post- 
transf ection, the cells were plated at low density in 
nucleoside- free medium. Eight days post- transf ection, 
30 surviving clones were isolated and expanded. Individual 
primary transf ectants were tested for gpl20 expression using 
the ELISA method described in the methods section. Several 
primary CHO transf ectants expressed significant quantities 
(10-120 ng/ml) of gpl20 (Figure 5) . Three of the highest 
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expressing clones were then subjected to increasing 
concentrations of methotrexate in order to amplify, in 
tandem, the copy number of the dhfr and gpl20 genes. Cell 
lines were established that express high levels of gpi20 
5 with rates of secretion greater than 1 mg/liter. These were 
then used to purify gpl20 to homogeneity. 

Six CHO cell lines were established, using the procedures 

«q___.~_.: Vi.--.~3 ~ ^ v» -» ^ t~ *3 <-> ^. v- — . *- — _ \» - ^>v n -> -j _ 

10 of the following proteins: HIV-1 gp!20 LAI/ gpl20 LAI -V3 H / 
g P 120 LA1 -V3<">-CD4<-> / gpl20jR.PL, gpl20jR.PL- V3 W , and gp^O^- V3 ( ->- 
CD4 ( ' ) . Metabolic labeling of these cells with 35 S- cysteine 
followed by immunoprecipitation with the human monoclonal 
antibody F105 and analyzed by SDS-PAGE and autoradiography 

15 showed the presence of the gpl20 proteins in the culture 
supernatant (Figure 14) . From these cell lines the gpl20 
proteins were purified to homogeneity. Analysis by SDS-PAGE 
followed by silver- staining showed the purity of these 
proteins to be greater than 90% (Figure 15) . 

20 

It was shown by FACScan analysis that the two CD4 binding 
mutants HIV-igpi2 0 1AI -V3 ( - ) -CD4 ( - ) and HIV-l gpl20 JR . FL -V3 ( * ) -CD4 ( - ) had 
no appreciable binding to recombinant cell lines designed to 
express high levels of human CD4 on their membrane surface 
25 (Figure 16, panel 4 and data not shown, respectively) . 
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Discussion 

The advantage of using the mutant HIV-l gpl20 envelope 
5 glycoproteins as immunogens is that these proteins will not 
elicit an immune response against the V3 loop, a highly 
immunodominant epitope on gpl20. This is significant because 
the V3 loop may skew the humoral immune response away from 
discontinuous epitopes in the CD4-binding site* Mutant HIV-l 

10 gpl20 envelope glycoproteins having partial and total V3 
loop deletions have been made (30) . Deletion of the V3 loop 
therefore exposes the CD4 -binding site to the immune system, 
allowing the immune system to mount a response against this 
critical region (18) ♦ Another advantage of using the mutant 

15 HIV-l gp!20 envelope glycoprotein as an immunogen is that it 
has significantly reduced affinity for cell surface CD4 . An 
efficient humoral immune response depends on the binding of 
antigen to B cell surface immunoglobulin. The presence of 
the high-af f inity CD4 receptor on large numbers of cells in 

20 the body may significantly diminish the ability of native 
gpl20 to induce an effective humoral immune response. The 
rationale of mutating gpl2 0 at the CD4 binding site is to 
redirect the mutant HIV-l gpl2 0 envelope glycoprotein away 
from cell surface CD4 toward immunoglobulin -bearing B cells, 

25 thereby allowing the immune system to mount a response 
against, inter alia , the CD4 -binding site. 



WO 94/22477 PCT/US94/03282 

51 

References 

1. Klatzmann, D.R., et. al . (1990) Immunodeficiency 
Reviews 2, 43-66. 

5 

2. Lasky, L.A. , et. al . (1987) Cell 50, 975-985. 

3. Maddon, P.J., et. al. (1986) Cell 47, 333-348. 
10 4. Maddon, P.J., et . al. (1966) Cell 54, 865-674. 

5. Maddon, P.J., et . al. (1985) Cell 42, 93-104. 

6. Maddon, P.J., et. al. (19 87) Proc. Natl. Acad. Sci. 
15 U.S.A. 84, 9155-9159. 

7. Richardson, N.E., et. al. (1988) Proc. Natl. Acad. Sci. 
U.S.A. 85, 6102-6106. 

20 8. Chao, B.H., et . al. (1989) J. Biol. Chem. 264, 5812- 
5817. 

9. Arthos, J., et. al . (1989) Cell 57, 469-481. 
25 10. Wang, J., et. al. (1990) Nature 348, 411-418. 

11. Ryu, S.-E., et. al. (1990) Nature 348, 419-426. 

12. Leonard, C.K., et. al . (1990) J. Biol. Chem. 265, 
30 10373-10382. 

13. Earl, P.L., et. al . (1990) Proc. Natl. Acad. Sci. 
U.S.A. 87, 648-652. 

35 14. Helseth, E., et. al . (1991) J. Virol. 65, 2119-2123. 



WO 94/22477 PCT/US94/03282 

52 

15. Bolognesi, D.P. (1990) TIBTech 8, 40-45. 

16. Olshevsky, U. , et . al . (1990) J. Virol. 64, 5701-5707. 
5 17. Steimer, K.S., et. al. (1991) AIDS 5, S135-143. 

18. Wyatt, R., et. al. (1992) J. Virol. 66, 6997-7004. 

19. Zolla-Pazner, S., et. al . (1992) Sem. in Virology 3, 
10 203-211. 

20. Steimer, K.S., et. al . (1991) Science 254, 105-108. 
21- Pollard, S.R., et. al . (1992) EMBO J. 11, 585-591. 
22. Okayama, H. (1983) Mol. Cell. Biol. 3, 280-289. 



15 



20 



25 



23. Maniatis, T. , et. al . (1990) Molecular Cloning, Vol. 1- 
3. 

24. Thomsen, D.R., et . al . (1984) Proc. Natl. Acad. Sci. 
U.S.A. 81, 659-663. 

25. Pennica, D. , et. al . (1983) Nature 301, 214-221. 

26. Wain-Hobson, S., et. al. (1985) Cell 40, 9-17. 

27. Koyanagi, Y. (1987) Science 236, 819-822. 
30 28. Reed, L.J. (1938) Am. J. Hyg., 27, 493-497. 

29. Cohn, E.J. et al., (1944) J. Clin. Invest. 23, 417-432. 

30. Shiow-Her, C. , et al. (1992) J. of Cellular Biochem. , 
35 Supplement 16E, Abstrtact Q105. 



WO 94/22477 



PCT/US94/03282 



53 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Progenies Pharmaceuticals, Inc. 

Cii) TITLE OF INVENTION: HlV-t VACCINES, ANTIBODY COMPOSITIONS RELATED 

THERETO, AND THERAPEUTIC AND PROPHYLACTIC USES 
THEREOF 

(iii) NUMBER OF SEQUENCES: 29 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooper & Dunham 

(B) STREET: 30 Rockefeller Plaza 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY : USA 

(F) ZIP: 10112 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
CO OPERATING SYSTEM: PC -DOS/MS-DOS 
CD) SOFTWARE: Pa tent In Release #1.24 

<vi> CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
<B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/037,816 

(B) FILING OATE: 26-MAR-1993 

(vi ii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: White, John P. 

(B) REGISTRATION NUMBER: 28,678 

CO REFERENCE /DOCKET NUMBER: 41190-A-PCT/JPW/AJM 

(ix) TELECOMMUNICATION INFORMATION: 
CA) TELEPHONE: C212) 977-9550 
(B) TELEFAX: (212) 664-0525 
CO TELEX: 422523 COOPUI 



C2> INFORMATION FOR SEQ ID NO:1: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 45 amino acids 

CB) TYPE: amino acid 
CO STRAND E0NESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 

Xaa Xaa Xaa Cys Xaa lie Xaa Xaa Xaa Xaa Xaa Xaa Trp xaa Xaa Xaa 



1 



5 



10 



15 



Xaa 



Xaa Ala Xaa Tyr Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 



Ser 



Xaa Xaa Thr Gly Xaa Xaa Xaa Xaa Arg Xaa Gly Xaa 
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35 40 45 

C2) INFORMATION FOR SEO ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi ) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Thr Leu Pro Cys Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val 
1 5 10 15 

Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys Ser 
20 25 30 

Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
35 40 45 

(2) INFORMATION FOR SEQ ID HO: 3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid - 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Thr Leu Pro Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Glu Val 
15 10 15 

Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin lie Arg Cys Ser 
20 25 30 

Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
35 40 45 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 bate pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genoaric) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GATCCTGCAG TCACCGTCCT TGACACGATG GATGCAATGA AGAGA 45 
(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE OESCRTPTIOM: SEO ID NO:S: 



AAGTCTTCTC CTCGGTCTTG TCTTTTTAAC ACCCAG 



36 



(2) INFORMATION FOR SEO ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 



Cxi) SEQUENCE DESCRIPTION? SEQ ID NO:6: 

TTCAGAA6AG GAGCCAGAAC AGAAAAATTG TGGGTC 36 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGAAAAAAGC GGCCGCTCAT TTTTCTCTCT GCACCACTC 39 
(2) INFORMATION FOR SEO ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
GATCGGCGCC AGAGTAGAAA A6TTGTGGGT CAC 33 
(2) INFORMATION FOR SEO ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 




WO 94/22477 



PCT/US94/03282 



56 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:9: 



CTG7AGAAAT TAATTGTACA GGTGCTGGAC ATTGTAACAT TAGTAGAGC 



49 



(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
CTCGAGCATG CATTCGAAGC TCGCTGATC 29 
(2) INFORMATION FOR SEO ID NO: 11: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEO ID NO:11: 

CAATTTATAA ACATGGTGCA GGAAGTAGG 29 

(2) INFORMATION FOR SEO ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 29 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAAATTATAA ACATGGTGCA GGAAGTAGG 29 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3125 base pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: tingle 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 



(A) NAME /KEY: CDS 

(B) LOCATION: 1555. .3115 
(D) OTHER INFORMATION: 
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Cxi) SEQUENCE DESCRIPTION: SEO ID NO: 13: 



TTGACATTGA 


TTATTGACTA 


GTTATTAATA 


GTAATCAATT 


ACGGGGTCAT 


TAGTTCATAG 


60 


CCCATATATG 


GAGTTCCGCG 


TTACATAACT 


TACGGTAAAT 


GGCCCGCCTG 


GCTGACCGCC 


120 


CAACGACCCC 


CGCCCATTGA 


CGTCAATAAT 


GACGTATGTT 


CCCATAGTAA 


CGCCAATAGG 


180 


GACTTTCCAT 


TGACGTCAAT 


GGGTGGACTA 


TTTACGGTAA 


ACTGCCCACT 


TGGCAGTACA 


240 


TCAAGTGTAT 


CATATGCCAA 


GTACGCCCCC 


TATTGACGTC 


AATGACGGTA 


AATGGCCCGC 


300 


CTGGCATTAT 


GCCCAGTACA 


TGACCTTATC 


GGACTTTCCT 


ACTTGGCAGT 


ACATCTACGT 


360 


ATTAGTCATC 


GCTATTACCA 


TGGTGATGCG 


GTTTTGGCAG 


TACATCAATG 


GGCGTGGATA 


420 


GCGGTTTGAC 


TCACGGGGAT 


TTCCAAGTCT 


CCACCCCATT 


GACGTCAATG 


GGAGTTTGTT 


460 


TTGGCACCAA 


AATCAACGGG 


ACTTTCCAAA 


ATGTCGTAAC 


AACTCCGCCC 


CATTGACGCA 


S40 


AATGGGCGGT 


AGGCGTGTAC 


GGTGGGAGGT 


CTATATAAGC 


AGAGCTCGTT 


TAGTGAACCG 


600 


TCAGATCGCC 


TGGAGACGCC 


ATCCACGCTG 


TTTTGACCTC 


CATAGAAGAC 


ACCGGGACCG 


660 


ATCCAGCCTC 


CGCGGCCGGG 


AACGGTGCAT 


TGGAACGCGG 


ATTCCCCGTG 


CCAAGAGTGA 


720 


CGTAAGTACC 


GCCTATAGAC 


TCTATAGCCA 


CACCCCTTTG 


GCTCTTATGC 


ATGCTATACT 


780 


GTTTTTGGCT 


TGGGCCAACA 


CCCCGTCCTA 


GATAGGTGAT 


GGTATAGCTT 


AGCCTATAGG 


840 


TGTGGCTTAT 


TGACCATTAT 


TGACCACTCC 


CCTATTGGTG 


ACGATACTTT 


CCATTACTAA 


900 


TCCATAACAT 


GGCCGCTCTT 


TGCCACAACT 


ATCTCTATTG 


GCTATATGCC 


AATACTCTGT 


960 


CCTTCAGAGA 


CTGACACGGA 


CTCTGTATTT 


TTACAGGATG 


GGGTCCCATT 


TATTATTTAC 


1020 


AAATTCACAT 


ATACAACAAC 


GCCGTCCCCC 


GTGCCCGCAG 


TTTTTATTAA 


CATGCGGGAT 


1080 


CTCCACGCGA 


ATCTCGGGTA 


CGTGTTCCGG 


ACATGGGCTC 


TTCTCCGGTA 


GCGGCGGAGC 


1140 


TCCACATCCG 


AGCCTGTCCC 


ATGCCCATGC 


CTCCAGCGGC 


TCATGGTCGC 


TCGGCAGCTC 


1200 


CTTGCTCCTA 


ACAGTGGAGG 


CCAGACTTAG 


GCACAGGACA 


ATGCCCACCA 


CCACCAGTGT 


1260 


GCCGCACAAG 


GCCGTGGCGG 


TAGGGTATGT 


GTCTGAAAAT 


GAGCTCGGAG 


ATTCGGCTCG 


1320 


CACCGCTGAC 


GCAGATGGAA 


GACTTAAGGC 


AGCGGCAGAA 


GAAGATGCAG 


GCAGCTGAGT 


1380 


TGTTGTATTC 


TGTAGAGTTG 


GAGGTAACTC 


CCGTTGCGGT 


GCTGTTAACG 


GTGGAGGCCA 


1440 


GTGTAGTCTG 


AGCAGTACTC 


GTTGCTGCCG 


CGCGCGCCAC 


CAGACATAAT 


AGCTGACAGA 


1500 


CTAACAGACT 


GTTCCTTTCC 


ATGGGTCTTT 


TCTGCAGTCA 


CCGTCCTTGA 


CACG ATG 


1557 



Net 
1 



GAT GCA ATG AAG AGA GGG CTC TGC TGT GTG CTG CTG CTG TGT GGA GCA 1605 

Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly Ala 

5 10 15 

GTC TTC CTT TCG CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA GGC 1653 

Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg Gly 

20 25 30 

GCC AGA ACA GAA AAA TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT GTG 1701 

Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val 

35 40 45 

TGG AAG GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT GCT AAA GCA 1749 
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Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala 
50 55 60 65 

TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC 1797 
Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys val Pro 
70 75 80 

ACA GAC CCC AAC CCA CAA GAA GTA GTA TTG GTA AAT GTG ACA GAA AAT 1845 
Thr Asp Pro Asn Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn 
85 90 95 

TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG CAT GAG GAT ATA 1893 
Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met His Glu Asp He 
100 105 110 

ATC ACT TTA TGG GAT CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC CCA 1941 
lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro 
115 120 125 

CTC TGT GTT AGT TTA AAG TGC ACT GAT TTG GGG AAT GCT ACT AAT ACC 1989 
Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn Thr 
130 135 140 H5 

AAT AGT AGT AAT ACC AAT AGT AGT AGC GGG GAA ATG ATG ATG GAG AAA 2037 
Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu Lys 
150 155 160 

GGA GAG ATA AAA AAC TGC TCT TTC AAT ATC AGC ACA AGC ATA AGA GGT 2085 
Gly Glu lie Lys Asn Cys Ser Phe Asn lie Ser Thr Ser He Arg Gly 
165 170 175 

AAG GTG CAG AAA GAA TAT GCA TTT TTT TAT AAA CTT GAT ATA ATA CCA 2133 
Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp He lie Pro 
180 1 85 190 

ATA GAT AAT GAT ACT ACC AGC TAT ACG TTG ACA AGT TGT AAC ACC TCA 2181 
lie Asp Asn Asp Thr Thr ser Tyr Thr Leu Thr Ser Cys Asn Thr Ser 
195 200 205 

GTC ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC ATA 2229 
Val He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He 
210 21S 220 225 

CAT TAT TGT GCC CCG GCT GGT TTT GCG ATT CTA AAA TGT AAT AAT AAG 2277 
His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asn Lys 
230 235 240 

ACG TTC AAT GGA ACA GGA CCA TGT ACA AAT GTC AGC ACA GTA CAA TGT 2325 
Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys 
245 250 255 

ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT CAA CTG CTG TTG AAT GGC 2373 
Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly 
260 265 270 

AGT CTA GCA GAA GAA GAG GTA GTA ATT AGA TCT GCC AAT TTC ACA GAC 2421 
Ser Leu Ala Glu Glu Glu Val Val lie Arg Ser Ala Asn Phe Thr Asp 
275 280 285 

AAT GCT AAA ACC ATA ATA GTA CAG CTG AAC CAA TCT GTA GAA ATT AAT 2469 
Asn Ala Lys Thr lie He Val Gin Leu Asn Gin Ser Val Glu He Asn 
290 295 300 305 

TGT ACA AGA CCC AAC AAC AAT ACA AGA AAA AGT ATC CGT ATC CAG AGG 2517 
Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Arg He Gin Arg 
310 315 320 

GGA CCA GGG AGA GCA TTT GTT ACA ATA GGA AAA ATA GGA AAT ATG AGA 2565 
Gly Pro Gly Arg Ala Phe Val Thr He Gly Lys He Gly Asn Met Arg 
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325 330 335 

CAA GCA CAT TGT AAC ATT AGT AGA GCA AAA TGG AAT GCC ACT TTA AAA 2613 
Gin Ala His Cys Asn lie Ser Arg Ala Lys Trp Asn Ala Thr Leu Lys 
340 345 350 

CAG ATA GCT AGC AAA TTA AGA GAA CAA TTT GGA AAT AAT AAA ACA ATA 2661 
Gin lie Ala Ser lys Leu Arg Gl-u Gin Phe Gly Asn Asn -Lys Thr lie 
355 360 365 

ATC TTT AAG CAA TCC TCA GGA GGG GAC CCA GAA ATT GTA ACG CAC AGT 2709 
lie Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He Val Thr His Ser 
370 375 380 385 

TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT 2757 
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe 
390 395 400 

AAT AGT ACT TGG TTT AAT AGT ACT TGG AGT ACT GAA GGG TCA AAT AAC 2B05 
Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn Asn 
405 410 415 

ACT GAA GGA AGT GAC ACA ATC ACA CTC CCA T6C AGA ATA AAA CAA TTT 2853 
Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys Arg lie Lys Gin Phe 
420 425 430 

ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG TAT GCC CCT CCC ATC 2901 
He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro lie 
435 440 445 

AGC GGA CAA ATT AGA TGT TCA TCA AAT ATT ACA GGG CTG CTA TTA ACA 2949 
Ser Gly Gin lie Arg Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr 
450 455 460 465 

AGA GAT GGT GGT AAT AAC AAC AAT GGG TCC GAG ATC TTC AGA CCT GGA 2997 
Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu He Phe Arg Pro Gly 
470 475 480 

GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT AAA TAT AAA 3045 
Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys 
485 490 495 

GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC AAG GCA AAG AGA 3093 
Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg 
500 505 510 

AGA GTG GTG CAG AGA GAA AAA T GAGCGGCCGC 3125 
Arg Val Val Gin Arg Glu Lys 
515 520 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

Met ASv Ate Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe Arg Arg 
20 25 30 

Gly Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
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35 



40 



Val Ttd Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp P*-o Asn Pro Gin Glu Val Val Leu Val Asn Val Thr Glu 
85 90 95 

Asn Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met His Glu Asp 
100 105 110 

lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

Pro Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn 
130 135 140 

Thr Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Net Met Glu 
145 150 155 160 

Lys Gly Glu lie Lys Asn Cys Ser Phe Asn He Ser Thr Ser He Arg 
165 170 175 

Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp He lie 
180 185 190 

Pro He Asp Asn Asp Thr Thr Ser Tyr Thr Leu Thr Ser Cys Asn Thr 
195 200 205 

Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro 
210 215 220 

lie His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asn 
225 230 235 240 

Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin 
245 250 255 

Cys Thr His Gty lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
260 265 270 

Gly Ser Leu Ala Glu Glu Glu Val Val lie Arg Ser Ala Asn Phe Thr 
275 280 285 

Asp Asn Ala Lys Thr He He Val Gin Leu Asn Gin Ser Val Glu lie 
290 295 300 

Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser I le Arg He Gin 
305 310 315 320 

Arg Gly Pro Gly Arg Ala Phe Val Thr He Gly Lys He Gly Asn Net 
325 330 335 

Arg Gin Ala His Cys Asn He Ser Arg Ala Lys Trp Asn Ala Thr Leu 
340 345 350 

Lys Gin He Ala Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr 
355 360 365 

He He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He Val Thr His 
370 375 380 

Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu 
385 390 395 400 

Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn 
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405 ^10 

Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys Arg lie Lys Gin 
420 425 ■ 430 

Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
435 4*0 445 

lie Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu 
450 455 460 

Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu He Phe Arg Pro 
465 470 475 480 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
485 490 495 

Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys 
500 505 510 

Arg Arg Val Val Gin Arg Glu Lys 
515 520 

(2) IN FORMAT ION F05 SEC ID NO: 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix> FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1522 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATG GAT GCA ATG AAG AGA GGG CTC TGC TGT GTG CTG CTG CTG TGT GGA 48 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 

GCA GTC TTC GTT TCG CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA 96 
Ala Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 30 

G6C GGC AGA GTA GAA AAG TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT 144 
Gly Gly Arg Val Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

GTG TGG AAA GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT* GCT AAA 192 
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

GCA TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA 240 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

CCC ACA GAC CCC AAC CCA CAA GAA GTA CTA TTG GAA AAT GTA ACA GAA 288 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 
85 90 95 

CAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG ATG CAG GAG GAT 336 
His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Gin Glu Asp 
100 105 HO 
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ATA ATC AGT TTA TGG GAT CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC 384 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

CCA CTC TGT GTT ACT TTA AAT TGC AAG GAT GTG AAT GCT ACT AAT ACC 432 
Pro Leu Cys Vat Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 HO 

ACT AAT GAT AGC GAG GGA ACG ATG GAG AGA GGA GAA ATA AAA AAC TGC 480 
7hr Asn Asp Ser Glu Gly Thr Met Giu Arg Gly Glu lie Lys Asn Cys 
H5 ISO 155 160 

TCT TTC AAT ATC ACC ACA AGC ATA AGA GAT GAG GTG CAG AAA GAA TAT 528 
Ser Phe Asn lie Thr Thr Ser lie Arg Asp Glu Val Gin Lys Glu Tyr 
165 170 175 

GCT CTT TTT TAT AAA CTT GAT GTA GTA CCA ATA GAT AAT AAT AAT ACC 576 
Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Asp Asn Asn Asn Thr 
180 185 190 

AGC TAT AGG TTG ATA AGT TGT GAC ACC TCA GTC ATT ACA CAG GCC TGT 624 
Ser Tyr Arg Leu He Ser Cys Asp Thr Ser Val lie Thr Gin Ala Cys 
195 200 205 

CCA AAG ATA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT GCC CCG SCT 672 
Pro Lys lie Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala 
210 215 220 

GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG ACG TTC AAT GGA AAA GGA 720 
Gly Phe Ala He Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 

CCA TGT AAA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AGG CCA 768 
Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro 
245 250 255 

GTA GTA TCA ACT CAA CTG CTG CTA AAT GGC AGT CTA GCA GAA GAA GAG 816 
Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

GTA GTA ATT AGA TCT GAC AAT TTC ACG AAC AAT GCT AAA ACC ATA ATA 864 
Val Val tie Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr He He 
275 280 285 

GTA CAG CTG AAA GAA TCT GTA GAA ATT AAT TGT ACA AGA CCC AAC AAC 912 
Val Gin Leu Lys Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn 
290 295 300 

AAT ACA AGA AAA AGT ATA CAT ATA GGA CCA GGG AGA GCA TTT TAT ACT 960 
Asn Thr Arg Lys Ser He His lie Gly Pro Gly Arg Ala Phe Tyr Thr 
305 310 315 320 

ACA GGA GAA ATA ATA GGA GAT ATA AGA CAA GCA CAT TGT AAC ATT AGT 1008 
Thr Gly Glu He He Gly Asp He Arg Gin Ala His Cys Asn He Ser 
325 330 335 

AGA GCA AAA TGG AAT GAC ACT TTA AAA CAG ATA GTT ATA AAA TTA AGA 1056 
Arg Ala Lys Trp Asn Asp Thr Leu Lys Gin He Val He Lys Leu Arg 
340 345 350 

GAA CAA TTT GAG AAT AAA ACA ATA GTC TTT AAT CAC TCC TCA GGA GGG 1104 
Glu Gin Phe Glu Asn Lys Thr He Val Phe Asn His Ser Ser Gly Gly 
355 360 365 

GAC CCA GAA ATT GTA ATG CAC AGT TTT AAT TGT GGA GGA GAA TTT TTC 1152 
Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
370 375 380 

TAC TGT AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT AAT AAT ACT 1200 
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Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr 
385 390 395 400 

GAA GGG TCA AAT AAC ACT GAA GGA AAT ACT ATC ACA CTC CCA TGC AGA 1248 

Giu Gty Ser Asn Asn Thr Glu Gly Asn Thr lie Thr Leu Pro Cys Arg 
405 410 415 

ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG TAT ;296 

He Lys Gin lie lie Asn Met Trp Gin Glu Val Gly tys Ala Met Tyr 
420 425 430 

GCC CCT CCC ATC AGA GGA CAA ATT AGA TGT TCA TCA AAT ATT ACA GGG 1344 

Ala Pro Pro He Arg Gly Gin lie Arg Cys Ser Ser Asn lie Thr Gly 
435 440 445 

CTG CTA TTA ACA AGA GAT GGT GGT ATT AAT GAG AAT GGG ACC GAG ATC 1392 

Leu Leu Leu Thr Arg Asp Gly Gly lie Asn Glu Asn Gly Thr Glu lie 

450 455 460 

TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA 1440 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
465 470 475 480 

TAT AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC 1488 

Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr 
485 490 495 

AAG GCA AAG AGA AGA GTG GTG CAA AGA GAA AAA T GAGCGGCCGC 1532 
Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
500 505 



(2) INFORMATION FOR SEO ID MO:16: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEO ID N0:16: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 30 

Gly Gly Arg Val Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

Val trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 
85 90 95 

His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Gin Glu Asp 

ioo 105 no 

He lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 140 
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Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly Glu He Lys Asn Cys 
U5 150 155 160 

Ser Phe Asn He Thr Thr Ser He Arg Asp Glu Vat GLn Lys Glu Tyr 
165 170 175 

Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Asp Asn Asn Asn Thr 
180 185 190 

Ser Tyr Arg Leu He Ser Cys Asp Thr Ser Val He Thr Gin Ala Cys 
195 200 205 

Pro Lys He Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala 
210 215 220 

Gly Phe Ala He Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 

Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 
245 250 255 

Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

Val Val He Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr He He 
275 280 285 

Val Gin Leu Lys Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn 
290 295 300 

Asn Thr Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr 
305 310 315 320 

Thr Gly Glu lie lit Gly Asp lie Arg Gin Ala His Cys Asn He Ser 
325 330 335 

Arg Ala Lys Trp Asn Asp Thr Leu Lys Gin He Val lie Lys Leu Arg 
340 345 350 

Glu Gin Phe Glu Asn Lys Thr He Val Phe Asn His Ser Ser Gly Gly 
355 360 365 

Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
370 375 380 

Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr 
385 390 395 400 

Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr He Thr Leu Pro Cys Arg 
405 410 415 

He Lys Gin He lie Asn Net Trp Gin Glu Val Gly Lys Ala Met Tyr 
420 425 430 

Ala Pro Pro He Arg Gly Gin He Arg Cys Ser Ser Asn He Thr Gly 
435 440 445 

Leu Leu Leu Thr Arg Asp Gly Gly He Asn' Glu Asn Gly Thr Glu lie 
450 455 460 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
465 470 475 480 

Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr 
485 490 495 



Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
500 505 
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(2) INFORMATION FOR SEQ 10 NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 base pairs 
(8) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ONA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : COS 

(B) LOCATION: 1..1474 

(D) OTHER INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEO 10 NO:17: 

ATG GAT GCA ATG AAG AGA GGG CTC TGC TGT GTG CTG CTG CTG TGT GGA 48 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

GCA GTC TTC GTT TCG CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA 96 
Ala Val Phe Val Ser Pre Ser Gin Gluile His Ala Arg Phe Arg Arg 
20 25 30 

GGC GCC AGA ACA GAA AAA TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT 144 
Gly Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

GTG TGG AAG GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT GOT AAA 192 
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

GCA TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA 240 
Ala Tyr Asp Thr Glu Val His Asn Vat Trp Ala Thr His Ala Cys Val 
65 70 75 80 

CCC ACA GAC CCC AAC CCA CAA GAA GTA GTA TTG GTA AAT GTG ACA GAA 288 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Val Asn Val Thr Glu 
85 90 95 

AAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG CAT GAG GAT 336 
Asn Phe Asn Met Trp Lys Asn Asp Met val Glu Gin Met His Glu Asp 
100 105 110 

ATA ATC AGT TTA TGG GAT CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC 384 
tie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys val Lys Leu Thr 
115 120 125 

CCA CTC TGT GTT AGT TTA AAG TGC ACT GAT TTG GGG AAT GCT ACT AAT 432 
Pro Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn 
130 135 140 

ACC AAT AGT AGT AAT ACC AAT AGT AGT AGC GGG GAA ATG ATG ATG GAG 480 
Thr Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu 
145 150 155 160 

AAA GGA GAG ATA AAA AAC TGC TCT TTC AAT ATC AGC ACA AGC ATA AGA 528 
Lys Gly Glu lie Lys Asn Cys Ser Phe Asn He Ser Thr Ser lie Arg 
165 170 175 

GGT AAG GTG CAG AAA GAA TAT GCA TTT TTT TAT AAA CTT GAT ATA ATA 576 
Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp lie II* 
180 185 190 

CCA ATA GAT AAT GAT ACT ACC AGC TAT ACG TTG ACA AGT TGT AAC ACC 624 
Pro He Asp Asn Asp Thr Thr Ser Tyr Thr Leu Thr Ser Cys Asn Thr 
195 200 205 
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TCA GTC ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC 672 
Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro 
210 215 220 

ATA CAT TAT TGT GCC CCG GCT GCT TTT GCG ATT CTA AAA TGT AAT AAT 720 
He His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asn 
225 230 235 240 

AAG ACG TTC AAT GGA ACA GGA CCA TGT ACA AAT GTC AGC ACA GTA CAA 768 
Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin 
245 250 255 

TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT CAA CTG CTG TTG AAT 816 
Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
260 265 270 

GGC AGT CTA GCA GAA CAA GAG GTA GTA ATT AGA TCT GCC AAT TTC ACA 864 
Gly Ser Leu Ala Glu Glu Glu Val Val lie Arg Ser Ala Asn Phe Thr 
275 280 285 

GAC AAT GCT AAA ACC ATA ATA GTA CAG CTG AAC CAA TCT GTA GAA ATT 912 
Asp Asn Ala Lys Thr lie lie Val Gin Leu Asn Gin Ser Val Glu lie 
290 295 300 

AAT TGT ACA GGT GCT GGA CAT TGT AAC ATT AGT AGA GCA AAA T6G AAT 960 
Asn Cys Thr Gly Ala Gly His Cys Asn lie Ser Arg Ala Lys Trp Asn 
305 310 315 320 

GCC ACT TTA AAA CAG ATA GCT AGC AAA TTA AGA GAA CAA TTT GGA AAT 1008 
Ala Thr Leu Lys Gin lie Ala Ser Lys Leu Arg Glu Gin Phe Gly Asn 
325 330 335 

AAT AAA ACA ATA ATC TTT AAG CAA TCC TCA GGA GGG GAC CCA GAA ATT 1056 
Asn Lys Thr lie lie Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He 
340 345 350 

GTA ACG CAC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT TCA 1104 
Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser 
355 360 365 

ACA CAA CTG TTT AAT AGT ACT TGG TTT AAT AGT ACT T6G AGT ACT GAA 1152 
Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu 
370 375 380 

GGG TCA AAT AAC ACT GAA GGA AGT GAC ACA ATC ACA CTC CCA TGC AGA 1200 
Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys Arg 
385 390 395 400 

ATA AAA CAA TTT ATA AAC ATG TGG CAG GAA GTA GGA AAA GCA ATG TAT 1248 
lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr 
405 410 415 

GCC CCT CCC ATC AGC GGA CAA ATT AGA TGT TCA TCA AAT ATT ACA GGG 1296 
Ala Pro Pro lie Ser Gly Gin lie Arg Cys Ser Ser Asn lie Thr Gly 
420 425 430 

CTG CTA TTA ACA AGA GAT GGT GGT AAT AAC AAC AAT GGG TCC GAG ATC 1344 
Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu lie 
435 440 445 

TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA 1392 
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
450 455 460 

TAT AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC 1440 
Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr 
465 470 475 480 

AAG GCA AAG AGA AGA GTG GTG CAG AGA GAA AAA T GAGCGGCCGC 1464 
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Ala Lys Arg Arg Val Val Cln Arg Glu Lys 
465 490 



(2) INFORMATION FOR SEQ ID N0:1S: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

<H) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 30 

Gly Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Val Asn Val Thr Glu 
85 90 95 

Asn Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met His Glu Asp 
100 105 110 

lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

Pro Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn 
130 135 140 

Thr Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu 
145 150 155 160 

Lys Gly Glu He Lys Asn Cys Ser Phe Asn lie Ser Thr Ser lie Arg 
165 170 175 

Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp lie lie 
180 185 190 

Pro He Asp Asn Asp Thr Thr Ser Tyr Thr Leu Thr Ser Cys Asn Thr 
195 200 205 

Ser Val lie Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro 
210 215 220 

lie His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asn 
225 230 235 240 

Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin 
245 250 255 

Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
260 265 270 

Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Ala Asn Phe Thr 
275 280 285 
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Asp Asn ALa Lys Thr He He Vat Gin Leu Asn Gin Ser val Glu He 
290 295 300 

Asn Cys Thr Gly Ala Gly His Cys Asn He Ser Arg Ala Lys Trp Asn 
305 310 315 320 

Ala Thr Leu Lys Gin He Ala Ser Lys Leu Arg Glu Gin Phe Glv Asn 
325 330 335 

Asn Lys Thr He He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He 
340 345 350 

Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser 
355 360 365 

Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu 
370 375 380 

Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu Pro Cys Arg 
385 390 395 400 

He Lys Gin Phe He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr 
405 410 415 

Ala Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly 
420 425 430 

Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu He 
435 440 445 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
450 455 460 

Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr 
465 470 475 480 

Lys ALa Lys Arg Arg Val Val Gin Arg Glu Lys 
485 490 

(2) INFORMATION FOR SEO ID N0:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1.-1439 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 19: 

ATG GAT GCA ATG AAG AGA GGG CTC TGC TGT STG CTG CTG CTG TGT GGA *8 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 

GCA GTC TTC GTT TCG CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA 96 
Ala Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 



20 



25 



30 * 



GGC 
Gly 



GGC AGA GTA 
Gly Arg Val 
35 



GAA AAG 

Glu Lys 



TTG TGG 
Leu Trp 
40 



GTC 
Val 



ACA GTC TAT TAT 
Thr Val Tyr Tyr 

45 



GGG GTA CCT 
Gly Val Pro 
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GTG TGG AAA GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT GCT AAA 192 
Vat Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
SO 55 60 

GCA TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA 240 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

CCC ACA GAC CCC AAC CCA CAA GAA GTA GTA TTG GAA AAT GTA ACA GAA 288 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 
85 90 95 

CAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG ATG CAG GAG GAT 336 
His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Gin Glu Asp 
100 105 110 

ATA ATC AGT TTA TGG GAT CAA ACC CTA AAG CCA TGT GTA AAA TTA ACC 384 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

CCA CTC TGT GTT ACT TTA AAT TGC AAG GAT GTG AAT GCT ACT AAT ACC 432 
Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 140 

ACT AAT GAT AGC GAG GGA ACG ATG GAG AGA GGA GAA ATA AAA AAC TGC 480 
Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly Glu He Lys Asn Cys 
145 150 155 160 

TCT TTC AAT ATC ACC ACA AGC ATA AGA GAT GAG GTG CAG AAA GAA TAT 528 
Ser Phe Asn He Thr Thr Ser He Arg Asp Glu Val Gin Lys Glu Tyr 
165 170 175 

GCT CTT TTT TAT AAA CTT GAT GTA GTA CCA ATA GAT AAT AAT AAT ACC 576 
Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Asp Asn Asn Asn Thr 
180 185 190 

AGC TAT AGG TTG ATA AGT TGT GAC ACC TCA GTC ATT ACA CAG GCC TGT 624 
Ser Tyr Arg Leu lie Ser Cys Asp Thr Ser Val He Thr Gin Ala Cys 
195 200 205 

CCA AAG ATA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT GCC CCG GCT 672 
Pro Lys He Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala 
210 215 220 

GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG ACG TTC AAT GGA AAA GGA 720 
Gly Phe Ala He Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 

CCA TGT AAA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AGG CCA 768 
Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 
245 250 255 

GTA GTA TCA ACT CAA CTG CTG CTA AAT GGC AGT CTA GCA GAA GAA GAG 816 
val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

GTA GTA ATT AGA TCT GAC AAT TTC ACG AAC AAT GCT AAA ACC ATA ATA 864 
Val Val He Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr He He 
275 280 285 

GTA CAG CTC AAA GAA TCT GTA GAA ATT AAT TGT ACA GGT GCT GGA CAT 912 
Val Gin Leu Lys Glu Ser Val Glu He Asn Cys Thr Gly Ala Gly His 
290 295 300 

TCT AAC ATT AGT AGA GCA AAA TGG AAT GAC ACT TTA AAA CAG ATA GTT 960 
Cys Asn He Ser Arg Ala Lys Trp Asn Asp Thr Leu Lys Gin He Vat 
305 310 315 320 

ATA AAA TTA AGA GAA CAA TTT GAG AAT AAA ACA ATA GTC TTT AAT CAC 1008 
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He Lys Leu Arg Glu Gin Phe Glu Asn Lys Thr He Val Phe Asn His 
325 330 335 

TCC TCA GGA GGG GAC CCA GAA ATT GTA ATG CAC AGT TTT AAT TGT GGA 1056 
5er Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gly 
340 345 350 

GGA GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG M04 
Gly Giu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp 
355 360 365 

AAT AAT AAT ACT GAA GGG TCA AAT AAC ACT GAA GGA AAT ACT ATC ACA 1152 
Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr He Thr 
370 375 380 

CTC CCA TGC AGA ATA AAA CAA ATT ATA AAC ATG TGG CAG GAA GTA GGA 1200 
Leu Pro Cys Arg lie Lys Gin lie He Asn Met Trp Gin Glu Val Gly 
385 390 395 400 

AAA GCA ATG TAT GCC CCT CCC ATC AGA GGA CAA ATT AGA TGT TCA TCA 1248 
Lys Ala Met Tyr Ala Pro Pro lie Arg Gly Gin He Arg Cys Ser Ser 
405 410 415 

AAT ATT ACA GGG CTG CTA TTA ACA AGA GAT GGT GGT ATT AAT 'GAG AAT 1296 
Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly He Asn Glu Asn 
420 425 430 

GGG ACC GAG ATC TTC ACA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1344 
Gly Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
435 440 . 445 

AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA 1392 
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly 
450 455 460 

GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAA AGA GAA AAA TG 1439 
Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
465 470 475 



AGCGGCCGC 



(2) INFORMATION FOR SEO ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: protein 

Cxi) SEOUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe Arg Arg 
20 25 30 

Gly Gly Arg Val Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 



1448 



SUBSTITUTE SHEET (RULE 26) 



WO 94/22477 



PCT/US94/03282 



71 



85 



90 



95 



His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Gin Glu Asp 
100 105 110 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

Pro Leu Cys val Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 140 

Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly Glu lie Lys Asn Cys 
145 150 155 160 

Ser Phe Asn lie Thr Thr Ser lie Arg Asp Glu Val Gin Lys Glu Tyr 
165 170 175 

Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Asp Asn Asn Asn Thr 
180 185 190 

Ser Tyr Arg Leu lie Ser Cys Asp Thr Ser Val lie Thr Gin Ala Cys 
195 200 205 

Pro Lys lie Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala 
210 215 220 

Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 

Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 
245 250 255 

Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

Val Val He Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr He lie 
275 280 285 

Val Gin Leu Lys Glu Ser Val Glu lie Asn Cys Thr Gly Ala Gty His 
290 295 300 

Cys Asn lie Ser Arg Ala Lys Trp Asn Asp Thr Leu Lys Gin He Val 
305 310 315 320 

He Lys Leu Arg Glu Gin Phe Glu Asn Lys Thr He Val Phe Asn His 
325 330 335 

Ser Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly 
340 345 350 

Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp 
355 360 365 

Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr He Thr 
370 375 380 

Leu Pro Cys Arg lie Lys Gin He He Asn Met Trp Gin Glu Val Gly 
385 390 395 400 

Lys Ala Met Tyr Ala Pro Pro lie Arg Gly Gin lie Arg Cys Ser ser 
405 410 415 

Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly He Asn Glu Asn 
420 425 430 

Gly Thr Glu lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
435 440 445 

Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly 
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450 455 460 

val Ala Pro Thr Lys Ala Lys Arg Arg Val Vat Gin Arg Glu Lys 
465 470 475 

(2) INFORMATION FOR SEO ID NO:21: 

(i) SEQUENCE CHARACTER i ST i CS: 

(A) LENGTH: 1484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA (genomic) 

Cix) FEATURE: 

CA) NAME/KEY: CDS 

CB) LOCATION: 1..1454 
CD) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEO ID HO:21: 

ATG GAT GCA ATG AAG AGA GGG CTC TGC TGT GTG C7G CTG CTG 7GT GGA 48 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 TO 15 

GCA GTC TTC GTT TCG CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA 96 
Ala Val Phe Val Ser Pro Ser Gin GLu He His Ala Arg Phe Arg Arg 
20 25 30 

GGC GCC AGA ACA GAA AAA TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT 144 
Gly Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

GTG TGG AAG GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT GCT AAA 192 
val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

GCA TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA 240 
Ala Tyr Asp Thr Glu val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

CCC ACA GAC CCC AAC CCA CAA GAA GTA GTA TTG GTA AAT GTG ACA GAA 288 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Val Asn Val Thr Glu 
85 90 95 

AAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG CAT GAG GAT 336 
Asn Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met His Glu Asp 
100 10S 110 

ATA ATC AGT TTA TGG GAT CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC 384 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

CCA CTC TGT GTT AGT TTA AAG TGC ACT GAT TTG GGG AAT GCT ACT AAT 432 
Pro Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn 
130 135 140 

ACC AAT AGT AGT AAT ACC AAT AGT AGT AGC GGG GAA ATG ATG ATG GAG 480 
Thr Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu 
145 150 155 160 

AAA GGA GAG ATA AAA AAC TGC TCT TTC AAT ATC AGC ACA AGC ATA AGA 528 
Lys Gly Glu He Lys Asn Cys Ser Phe Asn lie Ser Thr Ser lie Arg 
165 170 175 

GGT AAG GTG CAG AAA GAA TAT GCA TTT TTT TAT AAA CTT GAT ATA ATA 576 
Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp lie lie 



SUBSTITUTE SHEET (RULE 26) 



WO 94/22477 



PCT/US94/03282 



73 

180 185 190 

CCA ATA GAT AAT GAT ACT ACC AGC TAT ACG TTG ACA AGT TGT AAC ACC 624 
Pro lie Asp Asn Asp Thr Thr Ser Tyr Thr Leu Thr Ser Cys Asn Thr 
195 200 205 

TCA GTC ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC 672 
Ser Val He Thr Gtn Ala Cys Pro Lys Vat Ser Phe Glu Pro He Pro 
210 215 220 

ATA CAT TAT TGT GCC CCG GCT GGT TTT GCG ATT CTA AAA TGT AAT AAT 720 
lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asn 
225 230 235 240 

AAG ACG TTC AAT GGA ACA GGA CCA TGT ACA AAT GTC AGC ACA GTA CAA 768 
■Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gtn 
245 250 255 

TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT CAA CTG CTG TTG AAT 816 
Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
260 265 270 

CGC-ACT CTA GCA GAA CAA GAG GTA GTA ATT AGA TCT GCC AAT TTC ACA 864 
Gly Ser Leu Ala Glu Glu Glu Val Val Me Arg Ser Ala Asn Phe Thr 
275 280 285 

GAC AAT GCT AAA ACC ATA ATA GTA CAG CTG AAC CAA TCT GTA GAA ATT 912 
Asp Asn Ala Lys Thr He lie Val Gin Leu Asn Gin Ser Val Glu lie 
290 295 300 

AAT TGT ACA GGT GCT GGA CAT TGT AAC ATT AGT AGA GCA AAA TGG AAT 960 
Asn Cys Thr Gly Ala Gly His Cys Asn lie Ser Arg Ala Lys Trp Asn 
305 310 315 320 

GCC ACT TTA AAA CAG ATA GCT AGC AAA TTA AGA GAA CAA TTT GGA AAT 1008 
Ala Thr Leu Lys Gin lie Ala Ser Lys Leu Arg Glu Gtn Phe Gly Asn 
325 330 335 

AAT AAA ACA ATA ATC TTT AAG CAA TCC TCA GGA GGG GAC CCA GAA ATT 1056 
Asn Lys Thr lie He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu lie 
340 345 350 

GTA ACG CAC AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT TCA 1104 
val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser 
355 360 365 

ACA CAA CTG TTT AAT AGT ACT TGG TTT AAT AGT ACT TGG AGT ACT GAA 1152 
Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu 
370 375 380 

GGG TCA AAT AAC ACT GAA GGA AGT GAC ACA ATC ACA CTC CCA TGC AGA 1200 
Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu Pro Cys Arg 
385 390 395 400 

ATA AAA CAA TTT ATA AAC ATG GTG CAG GAA GTA GGA AAA GCA ATG TAT 1248 
He Lys Gin Phe He Asn Met Val Gin Glu Val Gly Lys Ale Met Tyr 
405 410 415 

GCC CCT CCC ATC AGC GGA CAA ATT AGA TGT TCA TCA AAT ATT ACA GGG 1296 
Ala Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly 
420 425 430 

CTG CTA TTA ACA AGA GAT GGT GGT AAT AAC AAC AAT CZZ TCC GAG ATC 1344 
Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu He 
435 440 445 

TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA 1392 
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
450 455 460 
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TAT AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA G6A GTA GCA CCC ACC 1440 
Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr 
u6S ^70 475 480 

AAG GCA AAG AGA AG AGTGGTGCAG AGAGAAAAAT GAGCGGCCGC 1484 
Lys Ala Lys Arg 



(2) INFORMATION FOR SEO ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 amino acids 

(B) TYPE : amino acid 
CD) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

Cxi) SEOUENCE DESCRIPTION: SEO ID NO:22: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe Arg Arg 
20 25 30 

Gly Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Val Asn Val Thr Glu 
85 90 95 

Asn Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met His Glu Asp 
100 105 110 

He lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

Pro Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn 
130 135 140 

Thr Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu 
145 150 155 160 

Lys Gly Glu He Lys Asn Cys Ser Phe Asn tie Ser Thr Ser lie Arg 
165 170 175 

Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp He He 
180 185 190 

Pro He Asp Asn Asp Thr Thr Ser Tyr Thr Leu Thr Ser Cys Asn Thr 
195 200 205 

Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro 
210 215 220 

He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asn 
225 230 235 240 

Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin 
245 250 255 

Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
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Giy Ser Leu Ala GLu Glu GLu Val Val lie Arg Ser Ala Asn Phe Thr 
275 280 285 

Asp Asn Ala Lys Thr n e lie val Gin Leu Asn Gin Ser Val Glu He 
290 295 300 

Asn Cys Thr Gly Ala Gly His Cys Asn lie Ser Arg Ala Lys Trp Asn 
305 310 315 320 

Ala Thr Leu Lys Gin lie Ala Ser Lys Leu Arg Glu Gin Phe Gly Asn 
325 330 335 

Asn Lys Thr He He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He 
340 345 350 

Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser 
355 360 365 

Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu 
370 375 380 

Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu Pro Cys Arg 
385 390 395 400 

He Lys Gin Phe He Asn Met Val Gin Glu Val Gly Lys Ala Met Tyr 
405 410 415 

Ala Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly 
420 425 430 

Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu He 
435 440 445 

Phe Arg Pro Gly Gly Gly Asp Net Arg Asp Asn Trp Arg Ser Glu Leu 
450 455 460 

Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr 
465 470 475 480 

Lys Ala Lys Arg 



(2) INFORMATION FOR SEO ID MO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: single 
CD) TOPOLOGY: linear 

(H) MOLECULE TYPE: DMA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.-1438 

(D) OTHER INFORMATION: 



(xi ) SEOUENCE DESCRIPTION: SEQ ID N0:23: 

ATG GAT GCA ATG AAG AGA GGG CTC TGC TGT GTG CTG CTG CTG TGT GGA 48 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 

GCA GTC TTC GTT TCG CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA 96 
Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe Arg Arg 



20 



25 



30 
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GGC GGC AGA GTA GAA AAG TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT 144 
Gly Gty Arg Val Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

GTG TGG AAA GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT GCT AAA 192 
Val Trp lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

GCA TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA 240 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

CCC ACA GAC CCC AAC CCA CAA GAA GTA GTA TTG GAA AAT GTA ACA GAA 288 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 
85 90 95 

CAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG ATG CAG GAG GAT 336 
His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met *iln Glu Asp 
100 105 110 

ATA ATC AGT TTA TGG GAT CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC 384 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

CCA CTC TGT GTT ACT TTA AAT TGC AAG GAT GTG AAT GCT ACT AAT ACC 432 
Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 140 

ACT AAT GAT AGC GAG GGA ACG ATG GAG AGA GGA GAA ATA AAA AAC TGC 480 
Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly Glu lie Lys Asn Cys 
145 150 155 160 

TCT TTC AAT ATC ACC ACA AGC ATA AGA GAT GAG GTG CAG AAA GAA TAT 528 
Ser Phe Asn lie Thr Thr Ser lie Arg Asp Glu Val Gin Lys Glu Tyr 
165 170 175 

GCT CTT TTT TAT AAA CTT GAT GTA GTA CCA ATA GAT AAT AAT AAT ACC 576 
Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Asp Asn Asn Asn Thr 
180 185 190 

AGC TAT AGG TTG ATA AGT TGT GAC ACC TCA GTC ATT ACA CAG GCC TGT 624 
Ser Tyr Arg Leu lie Ser Cys Asp Thr Ser Val He Thr Gin Ala Cys 
195 200 205 

CCA AAG ATA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT GCC CCG GCT 672 
Pro Lys He Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala 
210 215 220 

GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG ACG TTC AAT GGA AAA GGA 720 
Gly Phe Ala tie Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 

CCA TGT AAA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AGG CCA 768 
Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly tie Arg Pro 
245 250 255 

GTA GTA TCA ACT CAA CTG CTG CTA AAT GGC AGT CTA GCA GAA GAA GAG 816 
Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

GTA GTA ATT AGA TCT GAC AAT TTC ACC AAC AAT GCT AAA ACC ATA ATA 864 
Val Val lie Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr He lie 
275 280 285 

GTA CAG CTG AAA GAA TCT GTA GAA ATT AAT TGT ACA GGT GCT GGA CAT 912 
Val Gin Leu Lys Glu Ser Val Glu He Asn Cys Thr Gly Ala Gly His 
290 295 300 

TGT AAC ATT AGT AGA GCA AAA TGG AAT GAC ACT TTA AAA CAG ATA GTT 960 
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Cys Asn lie Ser Arg Ala Lys Trp Asn Asp Thr Leu Lys Gin lie Val 
305 310 315 320 

ATA AAA TTA AGA GAA CAA TTT GAG AAT AAA ACA ATA GTC TTT AAT CAC 1008 
He Lys Leu Arg Glu Gin Phe Glu Asn Lys Thr He Val Phe Asn His 
325 330 335 

TCC TCA GGA GGG GAC CCA -GAA ATT GTA ATG CAC AGT TTT AAT TGT GGA 1056 
Ser Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly 
340 345 350 

GGA GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG 1104 
Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp 
355 360 365 

AAT AAT AAT ACT GAA GGG TCA AAT AAC ACT GAA GGA AAT ACT ATC ACA 1152 
Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr lie Thr 
370 375 380 

CTC CCA TGC AGA ATA AAA CAA ATT ATA AAC ATG GTG CAG GAA GTA GGA 1200 
Leu Pro Cys Arg He Lys Gin lie lie Asn Met Val Gin Glu Val Gly 
385 390 395 400 

AAA GCA ATG TAT GCC CCT CCC ATC AGA GGA CAA ATT AGA TGT TCA TCA 1248 
Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin He Arg Cys Ser Ser 
405 410 415 

AAT ATT ACA GGG CTG CTA TTA ACA AGA GAT GGT GGT ATT AAT GAG AAT 1296 
Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly He Asn Glu Asn 
420 425 430 

GGG ACC GAG ATC TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG 1344 
Gly Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
435 440 445 

AGA AGT GAA TTA TAT AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA 1392 
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly 
450 455 460 

GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAA AGA GAA AAA T 1438 
Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
465 470 475 

GAGCGGCCGC 1448 



(2) INFORMATION FOR SEQ ID NO; 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys' Val Leu Leu Leu Cys Gly 
15 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe Arg Arg 
20 25 30 

Gly Gly Arg Val Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 
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Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 
85 90 95 

His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Gin Glu Asp 
100 105 no 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys val Lys Leu Thr 
115 120 125 

Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 140 

Thr Asn Asp Ser Glu Gly Thr Het Glu Arg Gly Glu He Lys Asn Cys 
U5 150 155 160 

Ser Phe Asn He Thr Thr Ser lie Arg Asp Glu Val Gin Lys Glu Tyr 
165 170 175 

Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Asp Asn Asn Asn Thr 
180 185 190 

Ser Tyr Arg Leu He Ser Cys Asp Thr Ser Val He Thr Gin Ala Cys 
195 200 205 

Pro Lys He Ser Phe Glu Pro lie Pro He His Tyr Cys Ala Pro Ala 
210 215 220 

Gly Phe Ala He Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 

Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 
245 2S0 255 

Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

Val Val He Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr lie lie 
275 280 285 

Val Gin Leu Lys Glu Ser Val Glu lie Asn Cys Thr Gly Ala Gly His 
290 295 300 

Cys Asn He Ser Arg Ala Lys Trp Asn Asp Thr Leu Lys Gin He Val 
305 310 315 320 

He Lys Leu Arg Glu Gin Phe Glu Asn Lys Thr He Val Phe Asn His 
325 330 335 

Ser Ser Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly 
340 345 350 

Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp 
355 360 365 

Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr He Thr 
370 375 380 

Leu Pro Cys Arg He Lys Gin lie He Asn Net Val Gin Glu Val Gly 
385 390 395 400 

Lys Ala Met Tyr Ala Pro Pro He Arg Gly Gin He Arg Cys Ser Ser 
405 410 415 

Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly He Asn Glu Asn 
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Gly Thr Glu Ite Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp 
435 440 445 

Arg Ser Glu Leu 7yr Lys Typ Lys Val Val Lys lie Glu Pro Leu Gly 
450 455 460 

val Ala Pro Thr Lys Ala Lvs Arg Arg Val Val Gin Arg Glu Lys 
465 470 475 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<ix> FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1567 

(D) OTHER INFORMATION: 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATG GAT GCA ATG AAG AGA GGG CTC TGC TGT GTG CTG CTG CTG TGT GGA 48 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 

GCA GTC TTC GTT TCG CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA 96 
Ala Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 30 

GGC GCC AGA ACA GAA AAA TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT 144 
Gly Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

GTG TGG AAG GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT GCT AAA 192 
Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

GCA TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA 240 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys val 
65 70 75 80 

CCC ACA GAC CCC AAC CCA CAA GAA GTA GTA TTG GTA AAT GTG ACA GAA 288 
Pro Thr Asp Pro Asn Pro Gin Glu val Val Leu Val Asn Val Thr Glu 
85 90 95 

AAT TTT AAC ATG TGG AAA AAT GAC ATG GTA GAA CAG ATG CAT GAG GAT 336 
Asn Phe Asn Met Trp Lys Asn Asp Met Val Glu Gin Met His Glu Asp 
100 105 110 

ATA ATC AGT TTA TGG GAT CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC 384 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

CCA CTC TGT GTT AGT TTA AAG TGC ACT GAT TTG GGG AAT GCT ACT AAT 432 
Pr© Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn 
130 135 140 

ACC AAT AGT AGT AAT ACC AAT AGT AGT AGC GGG GAA ATG ATG ATG GAG 480 
Thr Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu 
145 150 155 160 

AAA GGA GAG ATA AAA AAC TGC TCT TTC AAT ATC AGC ACA AGC ATA AGA 528 

Lys Gly Glu lie Lys Asn Cys Ser Phe Asn lie Ser Thr Ser lie Arg 
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165 170 175 

GGT AAG GTG CAG AAA GAA TAT GCA TTT TTT TAT AAA CTT GAT ATA ATA 576 
Gly Lys Val GLn Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp lie lie 
180 185 190 

CCA ATA GAT AAT GAT ACT ACC AGC TAT ACG TTG ACA AGT TGT AAC ACC 624 
Pro lie Asp Asn Asp Thr Tnr Ser Tyr Thr Leu Thr Ser Cys Asn 7nr 
195 200 205 

TCA GTC ATT ACA CAG GCC TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC 672 
Ser val lie Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro lie Pro 
210 215 220 

ATA CAT TAT TGT GCC CCG GCT GGT TTT GCG ATT CTA AAA TGT AAT AAT 720 
lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asn 
225 230 235 240 

AAG ACG TTC AAT GGA ACA GGA CCA TGT ACA AAT GTC AGC ACA GTA CAA 768 
Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin 
245 250 255 

TGT ACA CAT GGA ATT AGG CCA GTA GTA TCA ACT CAA CTG CTG TTG AAT 816 
Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
260 265 270 

GGC AGT CTA GCA GAA GAA GAG GTA GTA ATT AGA TCT GCC AAT TTC ACA 864 
Gly Ser Leu Ala Glu Glu Glu Val Val lie Arg Ser Ala Asn Phe Thr 
275 280 285 

GAC AAT GCT AAA ACC ATA ATA GTA CAG CTG AAC CAA TCT GTA GAA ATT 912 
Asp Asn Ala Lys Thr lie lie Val Gin Leu Asn Gin Ser Val Glu lie 
290 295 300 

AAT TGT ACA AGA CCC AAC AAC AAT ACA AGA AAA AGT ATC CGT ATC CAG 960 
Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser lie Arg lie Gin 
305 310 315 320 

AGG GGA CCA GGG AGA GCA TTT GTT ACA ATA GGA AAA ATA GGA AAT ATG 1008 
Arg Gly Pro Gly Arg Ala Phe Val Thr lie Gly Lys lie Gly Asn Met 
325 330 335 

AGA CAA GCA CAT TGT AAC ATT AGT AGA GCA AAA TGG AAT GCC ACT TTA 1056 
Arg Gin Ala His Cys Asn He Ser Arg Ala Lys Trp Asn Ala Thr Leu 
340 345 350 

AAA CAG ATA GCT AGC AAA TTA AGA GAA CAA TTT GGA AAT AAT AAA ACA 1104 
Lys Gin lie Ala Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr 
355 360 365 

ATA ATC TTT AAG CAA TCC TCA GGA GGG GAC CCA GAA ATT GTA ACG CAC 1152 
lie He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He Val Thr His 
370 375 380 

AGT TTT AAT TGT GGA GGG GAA TTT TTC TAC TGT AAT TCA ACA CAA CTG 1200 
Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu 
385 390 395 400 

TTT AAT AGT ACT TGG TTT AAT AGT ACT TGG AGT ACT GAA GGG TCA AAT 1248 
Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn 
405 410 415 

AAC AHT GAA GGA AGT GAC ACA ATC ACA CTC CCA TGC AGA ATA AAA CAA 1296 
Asn Thr Glu Gly Ser Asp Thr He Thr Leu Pro Cys Arg He Lys Gin 
420 425 430 

TTT ATA AAC ATG GTG CAG GAA GTA GGA AAA GCA ATG TAT GCC CCT CCC 1344 
Phe He Asn Met Val Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
435 440 445 
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ATC AGC GGA CAA ATT AGA TGT TCA TCA AAT ATT ACA GGG CTG CTA TTA 1392 
lie Ser Gly Gin He Arg Cys Ser Sep Asn lie Thr Gly Leu Leu Leu 
450 455 460 

ACA AGA GAT GGT GGT AAT AAC AAC AAT GGG TCC GAG ATC TTC AGA CCT 1440 
Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu He Phe Arg Pro 
„65 470 475 480 

GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT AAA TAT 1488 
Giy Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
485 490 495 

AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC AAG GCA AAG 1536 
Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys 
500 505 510 

AGA AGA GTG GTG CAG AGA GAA AAA TGA GCG G CCGC 1571 
Arg Arg val Val Gin Arg Glu Lys 
515 520 



(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 522 amino acids 

CB) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEOUENCE DESCRIPTION: SEO ID NO: 26: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 30 

Gly Ala Arg Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Val Asn Val Thr Glu 
85 90 95 

Asn Phe Asn Met Trp Lys Asn Asp Mat Val Glu Gin Met His Glu Asp 
100 105 110 

lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

Pro Leu Cys Val Ser Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn 
130 135 140 

Thr Asn Ser Ser Asn Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu 
145 150 155 160 

Lys Gly Glu Us Lys Acr. Cys Ser Phe Asn lie Ser Thr Ser He Arg 
165 170 175 

Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp lie lie 
180 185 190 

Pro I le Asp Asn Asp Thr Thr Ser Tyr Thr Leu Thr Ser Cys Asn Thr 
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195 200 205 



Ser Val lie Thr Gin Ala Cys Pro Lys Vat Ser Phe Gtu Pro lie Pro 
210 215 220 

He His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asn 
225 230 235 240 

Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin 
245 250 255 

Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn 
260 265 270 

Gly Ser Leu Ala Gtu Glu Glu Val Val He Arg Ser Ala Asn Phe Thr 
275 280 285 

Asp Asn Ala Lys Thr lie lie Vat Gin Leu Asn Gin Ser Val Glu lie 
290 295 300 

Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Arg He Gin 
305 310 315 320 

Arg Gly Pro Gly Arg Ala Phe Val Thr He Gly Lys He Gly Asn Met 
325 330 335 

Arg Gin Ala His Cys Asn He Ser Arg Ala Lys Trp Asn Ala Thr Leu 
340 345 350 

Lys Gin He Ala Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr 
355 360 365 

He He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu lit Val Thr His 
370 375 380 

Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu 
385 390 395 400 

Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn 
405 410 415 

Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys Arg He Lys Gin 
420 425 430 

Phe He Asn Met val Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 

435 440 445 

He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu 
450 455 460 

Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu He Phe Arg Pro 
465 470 475 480 

Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
485 490 495 

Lys val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys 
500 505 510 

Arg Arg Val Val Gin Arg Glu Lys 
515 520 

(2) INFORMATION FOR SEG ID N0:27: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

CA) NAME/KEY : CDS 

CB) LOCATION: 1 . .1522 
(0) OTHER INFORMATION: 



(xi) SEOUENCE DESCRIPTION: SEO ID NO:27: 

ATG GAT GCA ATG AAG AGA GGG CTC TGC TGT GTG CTG CTG CTG TGT GGA 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 

GCA GTC TTC GTT TCC CCC AGC CAG GAA ATC CAT GCC CGA TTC AGA AGA 
Ala Vat Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 30 

GGC GGC AGA GTA GAA AAG TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT 
Gly Gly Arg Val Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
35 40 45 

GTG TGG AAA GAA GCA ACC ACC ACT CTA TTT TGT GCA TCA GAT GCT AAA 
Val Trp Lys Glu Ala Tnr Thr Thr Leu Phe Cys Ala Ser Asp AU Lys 
50 55 60 

GCA TAT GAT ACA GAG GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA 
Ala Tyr Asp Thr Glu Val His Asn val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

CCC ACA GAC CCC AAC CCA CAA GAA GTA GTA TTG GAA AAT GTA ACA GAA 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 
65 90 95 

CAT TTT AAC ATG TGG AAA AAT AAC ATG GTA GAA CAG ATG CAG GAG GAT 
His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Gin Glu Asp 
100 105 110 

ATA ATC AGT TTA TGG GAT CAA AGC CTA AAG CCA TGT GTA AAA TTA ACC 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

CCA CTC TGT GTT ACT TTA AAT TGC AAG GAT CTG AAT GCT ACT AAT ACC 
Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 140 

ACT AAT GAT AGC GAG GGA ACG ATG GAG AGA GGA GAA ATA AAA AAC TGC 
Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly Glu lie Lys Asn Cys 
145 150 155 160 

TCT TTC AAT ATC ACC ACA AGC ATA AGA GAT GAG GTG CAG AAA GAA TAT 
Ser Phe Asn lie Thr Thr Ser lie Arg Asp Glu Val Gin Lys Glu Tyr 
165 170 175 

GCT CTT TTT TAT AAA CTT GAT GTA GTA CCA ATA GAT AAT AAT AAT ACC 
Ala Leu Phe Tyr Lys Leu Asp Val Val Pro lie Asp Asn Asn Asn Thr 
180 185 190 

AGC TAT AGG TTG ATA AGT TGT GAC ACC TCA GTC ATT ACA CAG GCC TGT 
Ser Tyr Arg Leu lie Ser Cys Asp Thr Ser Val lie Thr Gin Ala Cys 
195 200 205 

CCA AAG ATA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT GCC CCG GCT 
Pro Lys lie Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala 
210 215 220 

GGT TTT GCG ATT CTA AAG TGT AAT GAT AAG ACG TTC AAT GGA AAA GGA 
Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 
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CCA TGT AAA AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA ATT AGG CCA 768 
o ro cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro 
245 250 255 

GTA GTA TCA ACT CAA CTG CTG CTA AAT GGC AGT CTA GCA GAA GAA GAG 816 
Val VaL Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

GTA GTA ATT AGA TCT GAC AAT TTC ACG AAC AAT GCT AAA ACC ATA ATA 864 
Val Val He Arg Ser Asp Asn Phe Thr Asn Asn Ala tys Thr He He 
275 280 . 285 

GTA CAG CTG AAA GAA TCT GTA GAA ATT AAT TGT ACA AGA CCC AAC AAC 912 
Vai Gin Leu Lys Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn 
290 295 300 

AAT ACA AGA AAA AGT ATA CAT ATA GGA CCA GGG AGA GCA TTT TAT ACT 960 
Asn Thr Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr 
305 310 315 320 

ACA GGA GAA ATA ATA GGA GAT ATA AGA CAA GCA CAT TGT AAC ATT AGT 1008 
Thr Gly Glu He He Gly Asp He Arg Gin Ala His Cys Asn He Ser 
325 330 335 

AGA GCA AAA TGG AAT GAC ACT TTA AAA CAG ATA GTT ATA AAA TTA AGA 1056 
Arg Ala Lys Trp Asn Asp Thr Leu Lys Gin He Val He Lys Leu Arg 
340 345 350 

GAA CAA TTT GAG AAT AAA ACA ATA GTC TTT AAT CAC TCC TCA GGA GGG 1104 
Glu Gin Phe Glu Asn Lys Thr He Val Phe Asn His Ser Ser Gly Gly 
355 360 365 

GAC CCA GAA ATT GTA ATG CAC AGT TTT AAT TGT GGA GGA GAA TTT TTC 1152 
Asp Pro Glu He Val Net His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
370 375 380 

TAC TCT AAT TCA ACA CAA CTG TTT AAT AGT ACT TGG AAT AAT AAT ACT 1200 
Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr 
385 390 395 400 

GAA GGG TCA AAT AAC ACT GAA GGA AAT ACT ATC ACA CTC CCA TGC AGA 1248 
Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr He Thr Leu Pro Cys Arg 
405 410 415 

ATA AAA CAA ATT ATA AAC ATG GTG CAG GAA GTA GGA AAA GCA ATG TAT 1296 
He Lys Gin He He Asn Met Val Gin Glu Vat Gly Lys Ala Met Tyr 
420 425 430 

GCC CCT CCC ATC AGA GGA CAA ATT AGA TGT TCA TCA AAT ATT ACA GGG 1344 
Ala Pro Pro He Arg Gly Gin He Arg Cys Ser Ser Asn He Thr Gly 
435 440 445 

CTG CTA TTA ACA AGA GAT GGT GGT ATT AAT GAG AAT GGG ACC GAG ATC 1392 
Leu Leu Leu Thr Arg Asp Gly Gly lie Asn Glu Asn Gly Thr Glu He 
450 455 460 

TTC AGA CCT GGA GGA GGA GAT ATG AGG GAC AAT TGG AGA AGT GAA TTA 1440 
Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
465 470 475 480 

TAT AAA TAT AAA GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC 1488 
Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr 
485 490 495 

AAG GCA AAG AGA AGA GTG GTG CAA AGA GAA AAA T GAGCGGCCGC 1532 
Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 
500 505 
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(2) INFORMATION FOR SEO 10 NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:28: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 

Ala Val Phe Val Ser Pro Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 30 

Gly Gly Arg val Glu Lys Leu Trp val Thr val Tyr Tyr Gly Val Pro 
35 40 45 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
50 55 60 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
65 70 75 80 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Glu Asn Val Thr Glu 
85 90 95 

His Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met Gin Glu Asp 
100 105 110 

He lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
115 120 125 

Pro Leu Cys Val Thr Leu Asn Cys Lys Asp Val Asn Ala Thr Asn Thr 
130 135 HO 

Thr Asn Asp Ser Glu Gly Thr Met Glu Arg Gly Glu lie Lys Asn Cys 
145 150 155 160 

Ser Phe Asn He Thr Thr Ser He Arg Asp Glu Vol Gin Lys Glu Tyr 
165 170 175 

Ala Leu Phe Tyr Lys Leu Asp Val Val Pro He Asp Asn Asn Asn Thr 
180 185 190 

Ser Tyr Arg Leu He Ser Cys Asp Thr Ser Val He Thr Gin Ala Cys 
195 200 205 

Pro Lys He Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala 
210 215 220 

Gly Phe Ala He Leu Lys Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly 
225 230 235 240 

Pro Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 
245 250 255 

Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 
260 265 270 

Val Val He Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr He He 
275 280 285 

Val Gin Leu Lys Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn 
290 295 300 

Asn Thr Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr 
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3C5 



315 



320 



Tnr jVy G;u lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser 
325 33C 335 

A^g Ala Lys Tro Asn Asp Thr Leu Lys Gin lie Val lie Lys Leu Arg 
340 345 350 

Ziu utn Phe Glu Asn lys Thr lie val Pne Asn Mis Ser Ser Gly Gly 
355 360 365 

Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
370 375 3S0 

Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr 
385 390 395 400 

Glu Gly Ser Asn Asn Thr Glu Gly Asn Thr He Thr Leu Pro Cys Arg 
405 410 415 

He Lys Gin He He Asn Met Val Gin Glu Val Gly Lys Ala Met Tyr 
420 425 430 

Ala Pro Pro He Arg Gly Gin He Arg Cys Ser Ser Asn He Thr Gly 
435 440 4*5 

Leu Leu Leu Thr Arg Asp Gly Gly He Asn Glu Asn Gly Thr Glu He 
450 455 460 

Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
465 470 475 480 

Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro Thr 
485 490 495 

Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys 



(2) INFORMATION FOR SEO ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 

Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg 
15 10 15 



500 



505 
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What is claimed is: 

1. A recombinant nucleic acid molecule which encodes a 
mutant HIV-1 gpl2 0 envelope glycoprotein comprising a 

5 V3 loop deletion and a C4 domain fW _ >X) point mutation, 

wherein X is an amino acid residue other than 
tryptophan. 

2. The recombinant nucleic acid molecule of claim l, 
10 wherein X is a valine residue. 

3. The recombinant nucleic acid molecule of claim l # 
wherein the nucleic acid molecule is a DNA molecule . 

15 4 . The recombinant nucleic acid molecule of claim 3 , 
wherein the DNA molecule is a plasmid. 

5. The recombinant nucleic acid molecule of claim 4 f 
wherein the plasmid comprises the sequence of the 

20 plasmid designated PPI4-tPA. 

6. The recombinant nucleic acid molecule of claim l, 
wherein the C4 domain is an HIV-l^ gpl20 envelope 
glycoprotein C4 domain. 

25 

7. The recombinant nucleic acid molecule of claim 6, 
wherein the mutant HIV-l gp!20 envelope glycoprotein is 
a mutant HIV-i^ gpl20 envelope glycoprotein. 

30 8. The recombinant nucleic acid molecule of claim 1, 
wherein the C4 domain is an HIV-1jr.pl gpl20 envelope 
glycoprotein C4 domain. 



9. 



The recombinant nucleic acid molecule of claim S, 
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wherein the mutant HIV-i gpi20 envelope glycoprotein is 
a mutant HIV-1 jr _fl gpi20 envelope glycoprotein. 

10. The mutant HIV-1 gpl20 envelope glycoprotein encoded by 
5 the recombinant nucleic acid molecule of claim 1. 

11. A vaccine which comprises a therapeutically effective 
amount of the mutant HIV-l gpl20 envelope glycoprotein 
of claim 10, and an adjuvant. 

10 

12. A method of treating an HIV-1 -infected subject, which 
comprises immunizing the HIV-1 -infected subject with 
the vaccine of claim 11, thereby treating the HIV-i- 
infected subject. 

15 

13 . A vaccine which comprises a prophylactically effective 
amount of the mutant HIV-1 gpl20 envelope glycoprotein 
of claim 10, and an adjuvant. 

20 14. A method of reducing the likelihood of an HIV-l-exposed 
subject's becoming infected with HIV-l, which comprises 
immunizing the HIV-l-exposed subject with the vaccine 
of claim 13, thereby reducing the likelihood of the 
HIV-l-exposed subject's becoming infected with HIV-l. 



25 



30 



15. A method of reducing the likelihood of a non-HIV-1- 
exposed subject's becoming infected with HIV-l, which 
comprises immunizing the non - HIV- 1- exposed subject with 
the vaccine of claim 13 , thereby reducing the 
likelihood of the non -HIV-1 -exposed subject's becoming 
infected with HIV-l. 



16. 



A method of obtaining partially purified antibodies 
which specifically bind to the CD4 -binding domain of 
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HIV-l gpl20 envelope glycoprotein, which method 
comprises (a) immunizing a non- HIV- 1 -exposed subject 
with the vaccine of claim 13, (b) recovering from the 
immunized subject serum comprising said antibodies, and 
5 (c) partially purifying said antibodies, thereby 

obtaining partially purified antibodies which 
specifically bind to the CD4 - binding domain of HIV-l 
gpl20 envelope glycoprotein. 

10 17. The method of claim 16, wherein the subject is a human. 

18. The partially purified antibodies produced by the 
method of claim 16 . 

15 19. A pharmaceutical composition, which comprises a 
therapeutically effective amount of the partially 
purified antibodies of claim 18, and a pharmaceutical ly 
acceptable carrier . 



20 20. A method of treating an HIV-l -infected subject, which 
comprises administering to the subject a dose of the 
pharmaceutical composition of claim 19 effective to 
reduce the population of HIV-l -infected cells in the 
HIV-l -infected subject, thereby treating the HIV-l- 

25 infected subject. 



21. A method of treating an HIV-l- infected subject, which 
comprises administering to the subject a dose of the 
pharmaceutical composition of claim 19 effective to 
30 reduce the population of HIV-l in the HIV-l- infected 

subject, thereby treating the HIV-l -infected subject. 



35 



22. 



A composition which comprises a prophylactically 
effective amount of the partially purified antibodies 
of claim 18, and a pharmaceutically acceptable carrier. 
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23. A method of reducing the likelihood of an HIV-i-exposed 
subject's becoming infected with HlV-l, which comprises 
administering to the HIV- 1 - exposed subject a dose of 
5 the composition of claim 22 effective to reduce -the 

population of HIV-1 in the HIV- 1 -exposed subject, 
thereby reducing the likelihood of the subject's 
becoming infected with HIV-1. 

10 24. The method of claim 23 , wherein the subject is a 
medical practitioner. 

25. The method of claim 23, wherein the subject is a 
newborn infant. 

15 

26. A method of reducing the likelihood of a non-HIV-l- 
exposed subject's becoming infected with HIV-l as a 
result of exposure thereto during an incident wherein 
there is an increased risk of exposure to HIV-1, which 

20 comprises administering to the subject immediately 

prior to the incident a dose of the composition of 
claim 22 effective to reduce the population of HIV-1 to 
which the subject is exposed during the incident, 
thereby reducing the likelihood of the subject's 

25 becoming infected with HIV-1. 

27. The method of claim 26, wherein the subject is a 
medical practitioner- 



30 
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FIGURE 1 
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FIGURE 2 
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